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Abstract. On the basis of new, concise foundations, this paper estabhshes the four laws of 
thermodynamics, the Maxwell relations, and the stability requirements for response func- 
tions, in a form applicable to global (homogeneous), local (hydro dynamic) and microlocal 
(kinetic) equilibrium. 

The present, self-contained treatment needs very little formal machinery and stays very 
close to the formulas as they are applied by the practicing physicist, chemist, or engineer. 
From a few basic assumptions, the full structure of phenomenological thermodynamics and 
of classical and quantum statistical mechanics is recovered. 

Care has been taken to keep the foundations free of subjective aspects (which traditionally 
creep in through information or probability). One might describe the paper as a uniform 
treatment of the nondynamical part of classical and quantum statistical mechanics "with- 
out statistics" (i.e., suitable for the definite descriptions of single objects) and "without 
mechanics" (i.e., independent of microscopic assumptions). When enriched by the tradi- 
tional examples and applications, this paper may serve as the basis for a course on thermal 
physics. 
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1 Introduction 



It is worthwhile to have the structural properties of physical theories spelled out inde- 
pendent of computational techniques for obtaining quantitatively correct numbers from 
the assumptions made. This allows one to focus attention on the simplicity and beauty 
of theoretical physics, which is often hidden in a jungle of techniques for estimating or 
calculating approximations for quantities of interests. The standard approximation ma- 
chinery for calculating explicit thermodynamic properties of materials from first principles 
can be postponed to a later stage where one actually wants to do quantitative predictions 
of macroscopic properties from microscopic assumptions. 

In the present paper, a treatment of equilibrium statistical mechanics and the kinematic 
part of nonequilibrium statistical mechanics is given which shows that such a structural 
point of view leads to great conceptual simplifications: From a single basic assumption 
(Definition 17.11) . the full structure of phenomenological thermodynamics and of statistical 
mechanics is recovered (except for the third law which requires an additional quantization 
assumption), compatible with both a classical or a quantum microscopic view, but without 
the elaborate formalism needed to do actual statistical mechanics calculations. (In this 
paper, these are only hinted at, as a comment on the cumulant expansion.) 

Our treatment is self-contained and stays very close to the formulas as they are applied 
by the practicing physicist, chemist, or engineer. It needs very little formal machinery, in 
contrast to other axiomatic settings for thermodynamics (e.g., LlEB & Yngvason |80j). 
A different, well-known axiomatic foundation of (classical) thermodynamics was given by 
Caratheodory [28J. See also Emch & Liu [H]. 

In contrast to traditional treatments of thermodynamics (e.g., Haken [27], Jaynes j64j . 
Katz [TD]), we do not assume the basic thermodynamical laws and relations but derive them 
from simple definitions. In our treatment, the zeroth law is made precise as a definition, and 
each of the other three laws becomes a theorem. All formulas are valid rigorously without 
approximation, assuming only that the states in question are thermal states. 

Since thermodynamics is completely observer independent, its foundations should have this 
feature, too. Therefore, care has been taken to keep the foundations free of subjective 
aspects, traditionally introduced by an information theoretic approach (see Balian P] for 
a recent exposition) to statistical mechanics in terms of the knowledge of an observer. The 
present approach shows that (and in which sense) a fully objective, observer-independent 
foundation is possible. 

One might describe the paper as a uniform treatment of the nondynamical part of classical 
and quantum statistical mechanics "without statistics and probability" (i.e., suitable for 
the definite descriptions of single objects) and "without mechanics" (i.e., independent of 
microscopic assumptions). Essenitally, the paper develops all relevant results without as- 
suming or claiming anything definite about the meaning of the quantities assumed to exist 
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on the fundamental description level. Thus the interpretation of an underlying microscopic 
structure is left completely open; even the 'number of particles' can be taken figuratively! 

Avoiding a definite mechanical picture frees the foundations from the unresolved (and in 
principle unresolvable) issues regarding the truly fundamental constituents of nature and 
possibly needed modifications of quantum field theory at the Planck scale where gravita- 
tional effects must be taken into account. Avoiding probability implies that the traditional 
foundational problems of quantum mechanics - the most informative sources include Stapp 
|118] . Ballentine [in], Home & Whitaker [60], Peres & Terno |101] and the reprint 
collection by Wheeler & Zurek |129] - and of equilibrium statistical mechanics, care- 
fully analyzed by Sklar |115] for classical mechanics, (see also Grandy [52] and the older 
surveys by Ehrenfest [39], ter Haar |121] . Penrose [99j), and by Wallace [127j for 
quantum mechanics, do not affect the foundations of thermodynamics. 

This is one of a sequence of papers designed to give a mathematically precise and philosophi- 
cally consistent foundation of modern theoretical physics. The ultimate goal are foundations 
of physics which are fully objective, mathematically concise, and describe the universe as 
a whole, including at times when human observers could not exist. The first paper in the 
series (Neumaier [93j) defines and discusses the concepts of quantities, ensembles, and 
experiments in classical and quantum physics. 

On this basis (but formally independent), the present paper derives the standard formalism 
of equilibrium thermodynamics and statistical mechanics, formulated in a way that includes 
global equilibrium (homogeneous materials), local equilibrium (hydrodynamics and contin- 
uum mechanics), microlocal equilibrium (kinetic descriptions), and quantum equilibrium 
(microscopic descriptions). In this paper, only the kinematical level is discussed; another 
paper (Neumaier [91]) on the dynamics of thermal states is in preparation. The formalism 
developed is closely related to the theory of exponential families (cf. Barndorff-Nielsen 
[II1II2], Bernardo & Smith [T7]), although the language used there is completely differ- 
ent. 

The physics of our results is, of course, well-known since GiBBS [H]; see Uffink |126] 
for a history. However, the treatment of the material is new, as is the careful set-up 
used to arrive at the standard results. In particular: We make precise (in Section [2]) to 
which extent thermodynamical variables have a meaning outside equilibrium, thus ensure a 
formally well-defined second law. We give (in Section E]) a new interpration of the notion of 
an ensemble, valid for single macroscopic systems, whose properties are well-defined without 
reference to multiple realizations, and clarify the role of statistics in the foundations. We 
give (in Sections [TJ [TOl [111 ^ind [T2l) concise formal statements of the fundamental laws 
of thermodynamics. Finally, we demonstrate (in Section [T2!) that the conventional wave 
function form of quantum mechanics can be simply understood as the low temperature 
limit of thermodynamics. 
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The paper is organized as follows: To set the physical background needed to appreciate the 
remainder of the paper, Section [2] and Section [3] give a description of standard phenomeno- 
logical equilibrium thermodynamics for single-phase systems in the absence of chemical 
reactions and electromagnetic fields. The present, unconventional approach summarizes 
the assumptions needed in a way that allows an easy comparison with statistical mechan- 
ics. Section H] introduces Gibbs states, in a generality covering both the classical and the 
quantum case; their abstrct properties are studied in Section [5l Section [6] discusses the 
role of statistics in our approach. Section [7] defines thermal states and discusses their rel- 
evance for global, local, and microlocal equilibrium. Section [8] deduces the existence of an 
equation of state and connects the results to the phenomenological exposition in Section 
121 Section [9] discusses questions relating to different thermal description levels. Section [TO] 
proves the first law of thermodynamics. In Sections [HI we compare thermal states with 
arbitrary Gibbs states and deduce the extremal principles of the second law. Section [T2] 
shows that the third law is related to a simple quantization condition for the entropy and 
relates it to the time-independent Schrodinger equation. Section [13] discusses the hierarchy 
of equilibrium descriptions and how they relate to each other. 

Appendix [X] exhibits the relations to information theory, by putting a simple decision 
problem into our framework. In this example, we recover the usual interpretation of the 
value of the entropy as a measure of unobservable internal complexity (lack of information). 
Appendix [B] discusses the maximum entropy principle and its limitations. Appendix [C] 
contains the proofs of a few auxiliary results needed in the main text. 

In this paper, we shall denote quantities with a specific meaning by capital letters, con- 
forming to the tradition in thermodynamics. 

The level of formal precision is somewhere between that of mathematical physics and that of 
most treatments of theoretical physics. I made my best efforts to carefully argue all relevant 
issues and to make explicit all assumptions used in these argument. Definitions, theorems, 
and proofs are stated precisely. However, to avoid an excessive mathematical apparatus we 
do not specify the domains of definition of functions, and we state but do not use axioms 
relating to continuity properties and similar things, though these would be necessary for 
a completely rigorous treatment. If one assumes that the algebra of quantities is finite- 
dimensional, all proofs are valid rigorously as stated. For the infinite dimensional case, 
these arguments need further justification. However, assuming the results in Appendix [C| 
all other results still follow rigorously. (In fact, it can be shown that everything would hold 
rigorously when this algebra is embeddable into a C*-algebra. But the applications require 
certain unbounded operators to occur. Moreover, unlike our nearly trivial axioms, the 
C*-algebra axioms have no intuitive physical basis, but are abstracted from mathematical 
properties of algebras of bounded operators on Hilbert spaces. Thus, we do not use the 
C*-algebra framework.) 

Acknowledgment. Thanks to Roger Balian, Clemens Elster, Mike Mowbray, Hermann 
Schichl and Tapio Schneider for useful comments on earlier versions of this manuscript, and 
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to Tom Epperly who, many years ago, has introduced me to thermodynamics. 



2 Phenomenological thermodynamics 

To introduce the physical background for our developments, we discuss thermodynamic 
systems which describe the single-phase global equilibrium of matter in the absence of 
chemical reactions and electromagnetic fields. We call such systems standard thermo- 
dynamic systems; they are ubiquitous in the applications. In particular, a standard 
system is considered to be uncharged, homogeneous, and isotropic, so that each finite re- 
gion looks like any other and is very large in microscopic units. (Multiple phases would 
be only piecewise homogeneous; each phase separately is described as indicated, but dis- 
cussing the equilibrium at interfaces needs some additional effort, described in all textbooks 
on thermodynamics.) 

Our phenomenological approach is similar to that of Callen |i26j, a widely accepted au- 
thority on equilibrium thermodynamics, who introduces the basic concepts by means of a 
few postulates from which everything else follows. The present setting has the advantage 
that it easily matches the more fundamental approach based on statistical mechanics. Our 
axioms are also slightly more precise than Callen's in that they specify the kinematical 
properties of states outside equilibrium. This enables us to replace his informal thermody- 
namic stability arguments (which depends on dynamical assumption close to equilibrium) 
by rigorous mathematical arguments. 

The species of molecules of fixed chemical composition are labeled by an index j G J; we 
speak of particles of species j. A standard thermodynamic system is characterized by 
the number Nj of particles of each species j, the corresponding chemical potential 
fij of species j, the volume V, the pressure P, the temperature T, the entropy S, 
and the Hamilton energy H. These variables, the extensive variables Nj,V, S, H and 
the intensive variables fij, P, T, are jointly called the basic thermodynamic variables. 
In the terminology, we mainly follow the lUPAC convention (Alberty |1, Section 7]), 
except that we use the letter H to denote the Hamilton energy, as customary in quantum 
mechanics. In equilibrium, H equals the internal energy. The Hamilton energy should 
not be confused with the enthalpy which (is usually denoted by H but here) is given in 
equilibrium hj H + PV. (For a history of thermodynamics notation, see Battino et al. 

m) 

As customary in thermodynamics, we allow the Nj to have arbitrary nonnegative real 
values, not only integers. (Chemists use instead of particle numbers the corresponding mole 
numbers, which differ by a fixed numerical factor, the Avogadro constant; cf. Example 12.31 ) 
We group the Nj and the /ij into vectors and indexed by J and write fi-N = J2je.J ^^J■^J■ 
If there is just a single species, we drop the indices and have fi ■ n = fiN. In this section, 
all numbers are real. 
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Equilibrium thermodynamics is about characterizing so-called equilibrium states and com- 
paring them with similar nonequilibrium states. Everything of relevance in equilibrium 
thermodynamics can be deduced from the following 

2.1 Definition. (Axioms for phenomenological thermodynamics) 

(i) Temperature T, pressure P, volume V are positive, particle numbers Nj are nonnega- 
tive. There is a convex0 system function A of the intensive variables (T, P, jj) which is 
monotone increasing in T and monotone decreasing in P, such that the intensive variables 
are related by the equation of state 

A(T,P,/i) = 0. (1) 

The set of (T, P, /i) satisfying T > 0, P > and the equation of state is called the state 
space. 

(ii) Outside equilibrium, only the extensive variables are well-defined; they are additive 
under the composition of disjoint subsystems. The Hamilton energy H satisfies the Euler 
inequality 

H >TS - PV + fi- N (2) 

for all (T, P, fi) in the state space. Equilibrium states have well-defined intensive and 
extensive variables, and are are defined by equality in ([2]). 



This is the complete list of assumptions which define phenomenological equilibrium thermo- 
dynamics; the system function A can be determined either by fitting to experimental data, 
or by calculation from more fundamental descriptions, cf. Theoren l8.ll All other properties 
follow from the system function. Thus, the equilibrium properties of a material (e.g., "salty 
water", J = {salt, water}) are characterized by the system function A; each equilibrium 
instance of the material ("this glass of salty water") is characterized by a particular state 
(T, P, N), from which all equilibrium properties can be computed: 

2.2 Theorem. 

(i) In any equilibrium state, the extensive variables are given by 

dA dA dA 

S = Q—{T,P,f,), V = -Q—{T,P,fi), N = Q—{T,P,fi), (3) 

and the Euler equation 

H = TS - PV + N, (4) 
where Q is a positive number called the system size. 



^ See Appendix [C] for convexity and the associated differentiability almost everywhere. Surfaces of 
nondifferentiability correspond to so-called phase transitions. Although practically important and hence 
discussed in every textbook of thermodynamics, phase transitions do not raise foundational problems. 
Therefore, we look in this paper only at regions of state space where A is twice continuously differentiable. 
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(a) In equilibrium, we have the Maxwell reciprocity relations 



dV 
df 



dS dN, dS dNi 



dV dN, dNu 



dP' dT dfij' dP 
and the stability conditions 



d^ij djjk dfij 



dS ^ 



dV 
dP 



< 0, 



ON. 
^ > 0. 



(5) 



(6) 



Proof. In equilibrium, at fixed S, V, N, the triple (T, P, fi) is a maximizer of TS — PV + fi-N 
under the constraints A(T, P, fi) = 0, T > 0, P > 0. A necessary condition for a maximizer 
is the stationarity of the Lagrangian 

L(T, P^fi) =TS - PV + 12- N - nA{T, P, ^i) 

for some Lagrange multiplier Vt. Setting the partial derivatives to zero gives (IHl), and since 
the maximum is attained in equilibrium, the Euler equation (jl]) follows. The system size VL 
is positive since > and A is decreasing in P. Since the Hessian matrix of A, 





S^A 


d^A 




dT^ 


dPdT 






d^A 




dTdP 


Qp2 




d^A 


d^A 
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dTdi2 


dPdfi 



d^A \ 

d^dT 
d^A 

dfidP 
d^A 



/ dS dS dS \ 



) 



dT 
dV 



dP 
dV 



d, 



d 



dT 

dN 

\ df 



dP 
dN 

'dP 



da 

dN 

dfi ) 



is symmetric, the Maxwell reciprocity relations follow, and since A is convex, S is pos- 
itive semidefinite; hence the diagonal elements of S are nonnegative, giving the stability 
conditions. □ 



Note that there are further stability conditions since the determinants of all principal sub- 
matrices of S must be nonnegative. Also, Nj > implies that A is monotone increasing in 
each fij. 

2.3 Example. The equilibrium behavior of electrically neutral gases at sufficiently low 
pressure can be modelled as ideal gases. An ideal gas is defined by a system function of 
the form 

A(T,P,/i) = 5^7r,(r)e^^/^^-P, (7) 

where the TTj{T) are positive functions of the temperature, k is the Boltzmann constant 
and we use the bracketing convention fij/liT = fij/{kT). The Boltzmann constant defines 
the units in which the entropy is measured. By a change of units one can enforce any value 
of k. In traditional macroscopic units, the Boltzmann constant is very tiny 

k ^ 1.38065 ■ IQ-^^J/K. (8) 
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(For this and other constants, see CODATA [36]. In analogy with Planck's constant h, we 
write k in place of the customary or /c^, in order to be free to use the letter k for other 
purposes.) Differentiation with respect to P shows that Q = V is the system size, and from 
([1]), ([3]), and ([T5l) . we find that, in equilibrium, 

j j 

AT, = ^^e^^/'^, H = vY: {t^-AT) - vr,(T))e-^/^^. 

j 

Expressed in terms of T, V, N, we have 

PV = kTj2N„ f^3 = ^Tlog^^^, 

H = J2hAT)N,, h,{T) = ^t(t— log 7r,(r) - l), 
j 

from which S can be computed by means of the Euler equation (|4]). In particular, for 
one mole of a single substance, defined by = R/k ^ 6.02214 ■ 10^^ (the Avogadro 
constant), where R ~ 8.31447 J/K is the molar gas constant, we get the ideal gas law 

PV = RT; (9) 

cf. Clapeyron [33J, Jensen 



In general, the difference hj{T) — hj{To) can be found experimentally by measuring the 
energy needed for raising or lowering the temperature of pure substance j from Tq to T 
while keeping the Nj constant. In terms of infinitesimal increments, the heat capacities 

Cy(T) = dhj{T)/dT, we have 

h,{T) = hj{To) + [ dTCv{T). 

JTo 

From the definition of hj{T), we find that 

.,(T) = .,(r„)expy^_ -(l + ^). 

Thus there are two undetermined integration constants for each particle species. These 
cannot be determined experimentally as long as we are in the range of validity of the ideal 
gas approximation. Indeed, if we pick arbitrary constants pj and qj and replace tTj (T), /xj, H, 
and S by 



rr'fT) := eP^-*/^^7r,(T), /i' = + q, - kTp^ 



01 
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H' = H + J2 VjNj, S' = 3 + ^^^ qjNj, 
j j 
all relations remain unchanged. Thus, the Hamilton energy and the entropy of an ideal gas 
are only determined up to an arbitrary linear combination of the particle numbers. (This 
is an instance of the deeper problem to determine under which conditions thermodynamic 
variables are controllable; cf. the discussion in the context of Example 19.11 below.) In 
practice, this gauge freedom (present only in the ideal gas) can be fixed by choosing a 
particular standard temperature Tq and setting arbitrarily Hj{To) = 0, /ij(To) = 0. 

The ideal gas law (Q is the basis for the construction of gas thermometers, since the amount 
of expansion of volume in a long, thin tube can easily be read off from a scale along the 
tube. We have V = aL, where a is the cross section area and L is the length of the filled 
part of the tube, hence T = {aP/R)L. Thus, at constant pressure, the temperature is 
proportional to L. Modern thermometers work along similar principles. 



3 The laws of thermodynamics 

To measure the temperature of a system, one brings it in thermal contact with a thermome- 
ter and waits until equilibrium is established. (Thermal contact is precisely the condition 
that this will happen in a short amount of time.) The system and the thermometer will 
have the same temperature, which can be read off the thermometer. If the system is much 
larger than the thermometer, this temperature will be essentially the same as the temper- 
ature of the system before the measurement. The possibility of measuring temperature is 
sometimes called the zeroth law of thermodynamics, following Fowler & Guggenheim 
|45j . For a survey of the problems involved in defining and measuring temperature outside 
equilibrium, see Casas-Vasquez & Jou [29]. For the history of temperature, see Roller 
pl3] and Truesdell [121]. 

To be able to formulate the first law of thermodynamics we need the concept of a reversible 
change of states, i.e., changes preserving the equilibrium condition. For use in later sections, 
we define the concept in a slightly more general form, writing a for P and /i jointly. 

3.1 Definition. A state variable is an almost everywhere continuously differentiable 
function 0(T, a) defined on the state space (or a subset of it). Temporal changes in a state 
variable that occur when the boundary conditions are kept fixed are called spontaneous 
changes. 

A reversible transformation is a continuously differentiable mapping 

A^(T(A),a(A)) 

from a real interval into the state space; thus A(T(A), a(A)) = 0. The differential 

"^-^-t"" ,10, 
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describes the change of a state variable under arbitrary (infinitesimal) reversible trans- 
formations. (Divide by d\ and recognize the chain rule! In formal mathematical terms, 
differentials are exact 1-forms on the state space manifold.) 

Reversible changes per se have nothing to do with changes in time. However, by sufficiently 
slow, quasistatic changes of the boundary conditions, reversible changes can often be real- 
ized approximately as temporal changes. The degree to which this is possible determines 
the efficiency of thermodynamic machines. 

The state space is often parameterized by different sets of state variables, as required by the 
application. If T = T(/t, A), a = a(/€. A) is such a parameterization then the state variable 
g{T, a) can be written as a function of (k. A), 

g{K, A) = g{T{K, A), a{K, A)). (11) 

This notation, while mathematically ambiguous, is common in the literature; the names of 
the argument decide which function is intended. When writing partial derivatives without 

arguments, this leads to serious ambiguities. These can be resolved by writing ( — -- ) for 

\dX/ K 

the partial derivative of ffTTj) with respect to A; it can be evaluated using ffTOl) . giving the 
chain rule 

(I) +1^ ■(§?)■ (12) 

\oX/ K oT \oX/ K oa \oX/ K 
Here the partial derivatives in the original parameterization by the intensive variables are 
written without parentheses. 

Differentiating the equation of state (Q, using the chain rule f fTOj) . and simplifying using 
([3]) gives the Gibbs-Duhem equation 

= SdT - VdP + N -d^i (13) 

for reversible changes. If we differentiate the Euler equation (jl]), we obtain 

dH = TdS + SdT - PdV - VdP + fi ■ dN + N ■ dfi, 

and using (fT3|) . this simplifies to the first law of thermodynamics (Mayer [86], Joule 
citeJou,HELMHOLTZ citeHel, Clausius [34j) 

dH = TdS - PdV + n-dN. (14) 

Considering global equilibrium from a fundamental point of view, the extensive variables 
are the variables that are conserved (or at least change so slowly that they may be regarded 
as time independent on the time scale of interest). In the absence of chemical reactions, 
the particle numbers, the entropy, and the Hamilton energy are conserved; the volume is a 
system size variable which, in the fundamental view, must be taken as infinite (thermody- 
namic limit) to exclude the unavoidable interaction with the environment. However, real 
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systems are always in contact with their environment (e.g., stars through radiation, fluids 
through the forces excerted by the container), and the conservation laws are approximate 
only. In thermodynamics, the description of the system boundary is generally reduced 
to the degrees of freedom observable at a given resolution; e.g., a heat bath to maintain 
constant temperature, or a potential well to account for the conflnement in a container). 

The result of this reduced description (for derivations, see, e.g., Balian |7], Grabert 
[50] . Rau & MuLLER |105j) is a dynamical effect called dissipation (Thomson |123j ). 
described by the second law of thermodynamics (Clausius [35]). The Euler inequality 
([2]) together with the Euler equation (jl]) only express the nondynamical part of the second 
law since, in equilibrium thermodynamics, dynamical questions are ignored: Axiom (ii) 
says that if S, V, N are conserved (thermal, mechanical and chemical isolation) then the 
internal energy, 

U ■.= TS - PV + fi- N (15) 

is minimal in equilibrium; if T, V, N are conserved (mechanical and chemical isolation of a 
system at constant temperature T) then the Helmholtz energy, 

A ■=U ~TS = -PV + fi- N 

is minimal in equilibrium; and if T, P, N are conserved (chemical isolation of a system at 
constant temperature T and pressure P) then the Gibbs energy, 

G := A + PV = ^i- N 

is minimal in equilibrium. 

The third law of thermodynamics (Nernst [92]) says that entropy is nonnegative. 
This is a direct consequence of the monotonicity of A(T, P, /i) and ([3]). 



Consequences of the first law. The first law of thermodynamics describes the observable 
energy balance in a reversible process. The total energy fiux dH into the system is composed 
of the thermal energy flux or heat flux TdS, the mechanical energy flux —PdV, and 
the chemical energy flux fi ■ dN. 

The Gibbs-Duhem equation (fT3|) describes the energy balance necessary to compensate the 
changes d{TS) = TdS + SdT of thermal energy, d{PV) = PdV+VdP of mechanical energy, 
and d{fi ■ N) = fi ■ dN + N ■ dfi oi chemical energy in the energy contributions to the Euler 
equation to ensure that the Euler equation remains valid during a reversible transformation. 
Indeed, both equations together imply that d{TdS — PdV + fi ■ N — H) vanishes, which 
expresses the preservation of the Euler equation. 

Related to the various energy fiuxes are the thermal work 

Q = [ T{X)dS{X), 
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the mechanical work 
and the chemical work 

W^chcm = j f^W ■ dN{X) 

performed in a reversible transformation. The various kinds of work generally depend on 
the path through the state space; however, the mechanical work depends only on the end 
points, since the associated process is conservative. 

As is apparent from the formulas given, thermal work is done by changing the entropy 
of the system, mechanical work by changing the volume, and chemical work by changing 
the particle numbers. In particular, in case of thermal, mechanical, or chemical isolation, 
the corresponding fluxes vanish identically. Thus, constant S characterizes thermally iso- 
lated, adiabatic systems, constant V characterizes mechanically isolated, closed systems, 
and constant characterizes chemically isolated, impermeable systems. (The constancy 
depends on all properties of a standard system: global equilibrium, a single phase, and 
the absence of chemical reactions.) Of course, these boundary conditions are somewhat 
idealized situations, which, however, can be approximately realized in practice and are of 
immense scientific and technological importance. (Note that the terms 'closed system' has 
also a much more general interpretation - which we do not use in this paper -, namely as 
a conservative dynamical system.) 

The first law shows that, in appropriate units, the temperature T is the amount of energy 
needed to increase in a mechanically and chemically isolated system the entropy 5* by one 
unit. The pressure P is, in appropriate units, the amount of energy needed to decrease 
in a thermally and chemically isolated system the volume V by one unit. (In particular, 
increasing pressure decreases the volume; this explains the minus sign in the definition of 
P.) The chemical potential fij is, in appropriate units, the amount of energy needed to 
increase in a thermally and mechanically isolated system the particle number Nj by one. 
(With the traditional units, temperature and pressure are no longer energies.) 

We see that the entropy and the volume behave just like the particle number. This analogy 
can be deepened by observing that particle numbers are the natural measure of the amounts 
of "matter" of each kind in a system, and chemical energy flux is accompanied by adding 
or removing matter. Similarly, volume is the natural measure of the amount of "space" 
a system occupies, and mechanical energy flux in a standard system is accompanied by 
adding or removing space. Thus we may regard entropy as the natural measure of the 
amount of "heat" contained in a systeno, since thermal energy flux is accompanied by 

^ Thus, entropy is the modern replacement for the historical concepts of phlogiston and caloric, which 
failed to give a correct account of heat phenomena. Phlogiston turned out to be 'missing oxygen', an 
early analogue of the picture of positrons as holes, or 'missing electrons', in the Dirac sea. Caloric was a 
massless substance of heat which had almost the right properties, explained many effects correctly, and fell 
out of favor only after it became known that caloric could be generated in arbitrarily large amounts from 
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adding or removing heat. Looking at other extensive quantities, we also recognize energy 
as the natural measure of the amount of "power" , momentum as the natural measure of the 
amount of "force", and mass as the natural measure of the amount of "inertia" of a system. 
In each case, the notions in quotation marks are the colloquial terms which are associated 
in ordinary life with the more precise, formally defined physical quantities. For historical 
reasons, the words heat, power, and force are used in physics with a meaning different from 
the colloquial terms "heat", "power", and "force". 

Consequences of the second law. The second law is centered around the impossibility of 
perpetual motion machines due to the inevitable loss of energy by dissipation such as friction 
(see, e.g., Bowden & Leben |22]), uncontrolled radiation, etc.. This means that - unless 
continually provided from the outside - energy is lost with time until a metastable state 
is attained, which usually is an equilibrium state. Therefore, the energy at equilibrium 
is minimal under the circumstances dictated by the boundary conditions. In a purely 
kinematic setting as in our paper, the approach to equilibrium cannot be studied, and only 
the minimal energy principles - one for each set of boundary conditions - remain. 

Traditionally, the second law is often expressed in the form of an extremal principle for 
some thermodynamic potential. We derive here the extremal principles for the Hamilton 
energy, the Helmholtz energy, and the Gibbs energ}{§, which give rise to the Hamilton 
potential 

U{S, V, N) := max {TS - PV + fi- N \ A(T, P, jj,) = 0;T > 0; P > 0}, 

T,P,ij, 

the Helmholtz potential 

A{T, V, N) := max {-PV + ■ N \ A(T, P, /x) = 0; T > 0; P > 0}, 
p,ti 

and the Gibbs potential 

G{T, P, N) := max {f^ ■ N \ A(T, P, /i) = 0; T > 0; P > 0}. 

The Gibbs potential is of particular importance for everyday processes since the latter 
frequently happen at approximately constant temperature, pressure, and particle number. 
(For other thermodynamic potential used in practice, see Alberty [T]; for the maximum 
entropy principle, see Appendix [Bl) 

mechanical energy, thus discrediting the idea of heat being a substance. (For the precise relation of entropy 
and caloric, see KuHN ^HITH], Walter [128j . and the references quoted there.) In the modern picture, the 
extensivity of entropy models the substance-like properties of heat. But as there are no particles of space 
whose number is proportional to the volume, so there are no particles of heat whose number is proportional 
to the entropy. 

^ The different potentials are related by so-called Legendre transforms; cf. Rockafellar [112j for the 
mathematical properties of Legendre transforms, Arnol'd [B] for their application in in mechanics, and 
Alberty [T] for their application in chemistry. 
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3.2 Theorem. (Extremal principles) 

(i) In an arbitrary state, 

H>U{S,V,N), (16) 

with equality iff the state is an equihbrium state. The remaining thermodynamic variables 
are then given by 

r\ 

T = —U{S,V,N), P = -—UiS,V,N), ^ = —U{S,V,N), H = U{S,V,N). 

In particular, an equilibrium state is uniquely determined by the values of S, V, and N. 
(a) In an arbitrary state, 

H -TS > A{T,V,N), (17) 

with equality iff the state is an equilibrium state. The remaining thermodynamic variables 
are then given by 

f)A f)A f)A 

S=-g;^{T,V,N), P = - — {T,V,N), ^^ = —{T,V,N), 

H = A{T, V, N) + TS. 

In particular, an equilibrium state is uniquely determined by the values of T, V, and N. 
(Hi) In an arbitrary state, 

H -TS + PV > G{T,P,N), (18) 

with equality iff the state is an equilibrium state. The remaining thermodynamic variables 
are then given by 

r)C r)C r)C 

S = -^{T,P,N), V = —{T,P,N), ^ = —iT,P,N), 

H = G{T, P, N)+TS- PV. 
In particular, an equilibrium state is uniquely determined by the values of T, P, and N . 



Proof. We prove (ii); the other two cases are entirely similar. (ITTI) and the statement about 
equality is a direct consequnce of Axiom [2TT] (ii). Thus, the difference H — TS — A{T, V, N) 
takes its minimum value zero at the equilibrium value of T. Therefore, the derivative with 
respect to T vanishes, which gives the formula for S. To get the formulas for P and fi, we 
note that for constant T, the first law (IT^ implies 

dA = d{H - TS) = dH - TdS = -PdV + n-dN. 

For the reversible transformation which only changes P or fij, we conclude that dA = —PdV 
and dA = /i ■ dN, respectively. Solving for P and fij, respectively, implies the formulas for 
P and fij. □ 
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The above results imply that one can regard each thermodynamic potential as a complete 
alternative way to describe the manifold of thermal states and hence all equilibrium prop- 
erties. This is very important in practice, where one usually describes thermodynamic 
material properties in terms of the Helmholtz or Gibbs potential, using models like NRTL 
(Renon & Prausnitz [M], Prausnitz et al. |103]) or SAFT (Chapman et al. [301I3T]). 

The additivity of extensive quantities is reflected in corresponding properties of the ther- 
modynamic potentials: 

3.3 Theorem. The potentials U{S,V,N), A{T,V,N), and G{T,P,N) satisfy, for real 
A,Ai,A2 > 0, 

U{\S, \V, \N) = \U{S, V, N), (19) 
A{T, XV, XN) = XA{T, V, N), (20) 
G{T, P, XN) = XG{T, P, N), (21) 
U{X^S^ + X^S\ A + AV^ A^iV^ + A'iV") < X^U{S\ V\ N^) + X^U{S\ V\ N^), (22) 
A{T, A + AV^ A^A^^ + A^A^^) < X^A{T, V\ N^) + AM(T, 1/^ A^^), (23) 
G{T, P, X^N^ + X'^N^) < X^G{T, P, N^) + X'^G{T, P, N'^). (24) 
In particular, these potentials are convex in S, V, and N. 

Proof. The first three equations express homogeneity and are a direct consequence of the 
definitions. Inequality fl23l) holds since, for suitable P and fi, 

A{T,X'V^ + X^V^,X'N^ + X^N^) = -P{X'V' + X^V^) + fi ■ {X^N^ + X^N^) 

= X\~PV^+fi-N^) + X\-PV^ + i^-N^) 
< X'A{T,V\N') + X^A{T,V^,N^); 

and the others follow in the same way. Specialized to A^ + A^ = 1, the inequalities express 
the claimed convexity. □ 



For a system at constant temperature T, pressure P, and particle number A^, consisting of 
a number of parts labeled by a superscript k which are separately in equilibrium, the Gibbs 
energy is extensive, 

G = H -TS + PV = ^H^ -T^S^ + P^V'' 
= ^(i/'^ -TS'^ + PV'^) = 

Equilibrium requires that ^ G^ is minimal among all choices with ^ A^'^ = A^, and by 
introducing a Lagrange multiplier vector /i* for the constraints, we see that in equilibrium, 
the derivative of ^((^(T, P, A^'^) — ^* ■ N'^) with respect to each A^'^ must vanish. This 
implies that 

f)G 

^' = aA^^^'^'^')=^*- 
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Thus, in equilibrium, all must be the same. At constant T, V, and A^, one can do 
the same argument with the Helmholtz potential, and at constant S, V, and N with the 
Hamilton potential. In each case, the equilibrium is characterized by the constancy of the 
intensive parameters. 

Global equilibrium states are therefore macroscopically highly uniform. Their intrinsic 
complexity is apparent only in a microscopic treatment; the only macroscopic shadow of 
this complexity is the critical opalescence of fluids near a critical point (Andrews [5], 
FORSTER IB]). In particular, the traditional interpretation of entropy as a measure of 
disorder is often misleading: Macroscopic disorder is measured by the degree to which 
macroscopic space and time correlations are absent, not by entropy. In this sense, global 
equilibrium states are the most ordered macroscopic states in the universe rather than the 
most disordered ones. 

A system not in global equilibrium is characterized by macroscopic local inhomogeneities, 
indicating that the space-independent global equilibrium variables alone do not exhaust 
all available macroscopic information. The global equilibrium state is a state without 
additional macroscopic information. The contents of the second law of thermodynamics 
for global equilibrium states may therefore be phrased as follows (cf. Appendix [A]) : At 
global equilibrium, macroscopic order (homogeneity) is perfect and microscopic complexity 
is maximal. 



Using only the present axioms, one can say something about the behavior of a system close 
to equilibrium, at least in the following, idealized situation. Suppose that a system at 
constant S, V, and A^ which is close to equilibrium at some time t reaches equilibrium at 
some later time t*. Then the second law implies 

o</f(t)-i/(r)^(t-r)^, 

so that dH/dt < 0. Assuming that the system is composed of two parts, both in equilibrium 
in the time interval between t and t*, the time shift must be reversible on the parts, and 
the first law can be applied to them. Thus 

dH= J2 dH'' = J2 {T^dS^ - P'^dV^ + ii'' ■ dN^). 

k=l,2 k=l,2 

Since S, V, and A^ remain constant, we have dS^ + dS"^ = 0, dV^ + dV'^ = 0, dN^ + dN"^ = 0, 
and since for the time shift dH < 0, we find the inequality 

> (Ti - T^)dS' - - P^)dV^ + (/i^ - /i^) . dN\ 

This gives infromation about the direction of the flow in case that all but one of the extensive 
variables are known to be fixed. 

^ Much more carefully argued support for this statement, with numerous examples from teaching prac- 
tice, is in Lambert [77] . 
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In particular, at constant and A^^, we have dS^ < if > T^; i.e., "heat" (entropy) 
flows from the hotter part towards the colder part. At constant and N^, we have dV^ < 
if < P"^] i.e., "space" (volume) flows from lower pressure to higher pressure: the volume 
of the lower pressure part decreases and is compensated by a corresponding increase of the 
volume in the higher pressure part. And for a single species system at constant and V^, 
we have dN^ < if fi^ > fi^; i.e., "matter" (particle number) flows from higher chemical 
potential towards lower chemical potential. This gives temperature, pressure, and chemical 
potential the familiar intuitive interpretation. 

This is a shadow of the far reaching fact that, in nonequilibrium thermodynamics, gradients 
in the intensive variables induce a dissipative dynamics that tends to diminish these gradi- 
ents, thus enforcing (after the time needed to reach equilibrium) agreement of the intensive 
variables of different parts of a system. While these dynamical issues are outside the scope 
of the present work, they motivate the fact that one can control some intensive parameters 
of the system by controlling the corresponding intensive parameters of the environment and 
making the walls permeable to the corresponding extensive quantities. This corresponds 
to standard procedures familiar to everyone from ordinary life: heating to change the tem- 
perature, applying pressure to change the volume, immersion into a substance to change 
the chemical composition; also, in the more general thermal models discussed in Section [9] 
applying forces to displace an object, etc.. 

The stronger nonequilibrium version of the second law says that (for suitable boundary 
conditions) equilibrium is attained after some time (stictly speaking, only in the limit of 
infinite time). This implies that the energy difference 

6E = H- U{S, V,N) = H -TS - A{S, V, N) = H - TS + PV = G{S, V, N) 

is the amount of energy that is dissipated in order to reach equilibrium. In an equilibrium 
setting, we can only compare what happens to a system prepared in a nonequilibrium state 
assuming that, subsequently, the full energy difference 6E is dissipated so that the system 
ends up in an equilibrium state. Since few variables describe everything of interest, this 
constitutes the power of equilibrium thermodynamics. But this power is limited, since equi- 
librium thermodynamics is silent about when - or whether at all - equilibrium is reached. 
Indeed, in many cases, only metastable states are reached, which change too slowly to ever 
reach equilibrium on a human time scale. 

The phenomenological description given so far is completely adequate for systems in global 
equilibrium with a single phase only and no chemical reactions. From the formulas provided, 
it is now an easy step to go to various examples and applications. A full discussion of global 
equilibrium would also involve the equilibrium treatment of multiple phases and chemical 
reactions. However, these involve no foundational problems, and their discussion would 
offer little new compared with traditional textbook treatments. Hence they are not treated 
here. 
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More important is the observation that, when considered in sufficient detail, no physical 
system is truly in global equilibrium; one can always find smaller or larger deviations. To 
describe these deviations, extra variables are needed, resulting in a more complete but also 
more complex model. At even higher resolution, this model is again imperfect and an 
approximation to an even more complex, better model. This refinement process may be 
repeated in many, perhaps even infinitely many stages. At the most detailed stages, we 
transcend the frontier of current knowledge in physics, but even as this frontier recedes, 
deeper and deeper stages with unknown details are imaginable. 

Therefore, it is desirable to have a meta-description of thermodynamics that, starting with 
a detailed model, allows to deduce the properties of each coarser model, in a way that all 
description levels are consistent with the current state of the art in physics. Moreover, the 
results should be as independent as possible of unknown details at the lowest levels. 

This will be done in the remainder of the paper. First we'll look at an anonymous bottom 
level, where Gibbs states are the players in the field and define values for arbitrary quan- 
tities. As in this section, the intensive variables determine the state (which now is a more 
abstract object), whereas the extensive variables now appear as values of other abstract 
objects called quantities. This change of setting allows the natural incorporation of quan- 
tum mechanics, where quantities need not commute, while values are numbers observable 
in principle, hence must satisfy the commutative law. 

Then we introduce thermal states by selecting the quantities whose values shall act as 
extensive variables in a thermal model. On this level, we shall be able to reproduce the 
phenomenological setting of the present section from first principles; see the discussion after 
Theorem 18.31 If the underlying detailed model is assumed to be known then the system 
function, and with it all thermal properties, are computable in principle, although we only 
hint at the ways to do this numerically. We also look at a hierarchy of thermal models 
based on the same bottom level description and discuss how to decide which description 
levels are appropriate. 



4 Gibbs states 

Any fundamental description of physical systems must give account of the numerical values 
of quantities observable in experiments when the system under consideration is in a specified 
state. Moreover, the form and meaning of states, and what is observable in principle, must 
be clearly defined. For reasons given in Neumaier [93], we avoid using the customary word 
'observable', and consider an axiomatic conceptual foundation on the basis of quantities and 
their values. This is consistent with the conventions adopted by the International System 
of Units (SI) |12U] . who declare: "A quantity in the general sense is a property ascribed 
to phenomena, bodies, or substances that can be quantified for, or assigned to, a particular 
phenomenon, body, or substance. [...] The value of a physical quantity is the quantitative 
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expression of a particular physical quantity as the product of a number and a unit, the 
number being its numerical value." Since quantities can be added and multiplied, the set 
of all quantities has an algebraic structure whose properties we now formulate. 

The quantities of interest are taken to be the elements of a Euclidean *-algebra E. 

The simplest family of Euclidean *-algebras, and the only one with which the reader must 
be familiar to understand the paper, is the algebra E = C^^^ of square complex N x N 
matrices. (This algebra models an N-level quantum system, but also quantum field theory 
in the lattice approximation.) Here the quantites are square matrices, the constants are the 
multiples of the identity matrix, involution * is conjugate transposition, and the integral / 
is the trace, so that all quantities are strongly integrable. 

For those, who want to understand a little more, we give a formally precise definition. To 
say that a set E of quantities is a Euclidean *-algebra means the following: E is a complex 
vector space containing the complex numbers as constants. Apart from an associative 
product (commutative only in the classical case) one has an involution * reducing on the 
constants to complex conjugation and a complex-valued integral J defined on a subspace 
of strongly integrable quantities. (The integral can often be naturally extended to a 
significantly larger class of integrable quantities.) A partial order > is then defined by 
g > iS g* = g and fh*gh > for all strongly integrable h, and g>hiSg — h>0. 
A quantity g is called Hermitian if g* = g, and bounded if g*g < o? for some a G M. 
Apart from the standard rules for *-algebras (Neumaier ^3J discusses in detail what must 
be assumed and what can then be proved) and the linearity of the integral, we assume the 
following axioms for all bounded quantities ^f, h and all strongly integrable hi. 

(EAl) (jf bounded, strongly integrable ^ /i*, ^f/i, % strongly integrable, 

(EA2) {fhr = fh\ fgh = fhg, 

(EA3) Jh*h>0 if/i^O, 

(EA4) Jh*gh = for all strongly integrable h =^ 9 = (nondegeneracy), 

(EA5) fh*hi^Q fghi-^0, fh*ghi-^0, 

(EA6) hi 10 MJhi = (Dini property). 

Here, integrals extend over the longest following product or quotient (while later, differential 
operators act on the shortest syntactically meaningful term), the monotonic limit is defined 
by S'i i iff, for every strongly integrable h, the sequence (or net) Jh*gih consists of 
real numbers converging monotonically decreasing to zero. The reader is invited to check 
exphcitly that (EAl)-(EA6) hold for E = C^^^. 

Note that (EA3) implies the Cauchy-Schwarz inequality 

lighTigh) < Jg*g Jh*h, 

which implies that every strongly integrable quantity is bounded. Some results needed in 
the following are given in Appendix O, but proved there only in case E is finite-dimensional. 
To justify our treatment rigorously in case the algebra E is infinite-dimensional, one would 
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need to use (EA5) and (EA6) and perhaps further, technical assumptions. However, the 
main text is fully rigorous if the results of Appendix O are assumed in addition to (EAl)- 
(EA6). 



4.1 Examples. Apart from the matrix algebra, three basic realizations of the axioms are 
relevant to nonrelativistic physics. We give here some details to draw the connection with 
traditional statistical mechanics. However, the remainder is completely independent of de- 
tails how our axioms are realized; a specific realization is needed only when doing specific 
quantitative calculations. Therefore, the reader lacking the formal mathematical back- 
ground to understand some details in these examples may simply skip them. 

(i) (Nonrelativistic classical mechanics) An atomic A^-particle system is described in 
classical mechanics by the phase space with six coordinates - position G and 
momentum G - for each particle.) The algebra 



of infinitely differentiable, complex-valued functions g{x^''^ ,p^'^) of positions and momenta 
is a Euclidean *-algebra with complex conjugation as involution and the Liouville integral 



where C is a positive constant. Strongly integrable quantities are the Schwartz functions 
in E. The axioms are easily verified. 

(ii) (Classical fluids) A fluid is classically described by an atomic system with an indefinite 
number of particles. The appropriate algebra for a single species of monatomic particles 
is the direct sum E = ®n>o^n whose quantities are infinite sequences g = ((70, ...) of 
gN G Eat, with Ejy as in (i), and weighted Liouville integral 



Here Cat is a symmetry factor for the symmetry group of the A^-particle systen, which equals 
h^'^Nl for indistinguishable particles; h = 2'n-h is Planck's constant. This accounts for the 
Maxwell statistics and gives the correct entropy of mixing. Classical fluids with monatomic 
particles of several different kinds requires a tensor product of several such algebras, and 
classical fluids composed of molecules requires additional degrees of freedom to account for 
the rotation and vibration of the molecules. 

(iii) (Nonrelativistic quantum mechanics) Let EI be a Euclidean space, a dense sub- 
space of a Hilbert space. Then the algebra E := LinH of continuous linear operators on EI 
is a Euclidean *-algebra with the adjoint as involution and the quantum integral 



En := C°°(M^^) 





N>0 



J 9 = tr^. 
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given by the trace of the quantity in the integrand. Strongly integrable quantities are the 
operators (7 G E for which all hgh' with h,h' & K are trace class. (This includes all linear 
operators of finite rank.) Again, the axioms are easily verified. 

Our next task is to specify the formal properties of the value of a quantity. We assign to 
certain quantities g (including all bounded quantities) a value {g). Such an assignment 
is called a state (of the system under investigation) if it has the following properties (of 
which we only use the first three explicitly; the fourth would be needed to justify certain 
operations involving differentiation, which we silently assume to be valid): 

(Rl) (1) = 1, {g*) = {gr, {g + h) = {g) + {h), 

(R2) {ag) = a{g) for a G C, 

(R3) If c/ > then (g) > 0, 

(R4) If giEE, gi 10 then inf = 0. 

Note that this formal definition of a state - always used in the remainder of the paper - 
differs from the phenomenological version in Section [21 The connection will be made in 
Section [HI 

In practice, states are assigned by well-informed and experimentally testable judgment 
concerning one's equipment. Indeed, from a practical point of view, theory defines what an 
object is: A gas is considered to be an ideal gas with certain values of temperature, pressure 
and volume if it behaves to a satisfactory degree like the mathematical model of an ideal 
gas with these parameters, and a solid is considered to be a crystal with certain numerical 
properties if it behaves to a satisfactory degree like the mathematical model of a crystal with 
these propertied. In general, we know (or rather assume on the basis of past experience, 
claims of manufacturers, etc.) that certain materials or machines reliably produce states 
that (to a satisfactory degree for the purpose of the experiment or application) depend only 
on variables that are accounted for in our theory and that are, to a satisfactory degree, 
either fixed or controllable. The nominal state of a system can be checked and, if necessary, 
corrected by calibration, using appropriate measurements which reveal the parameters 
characterizing the state. 

All states arising in thermodynamics have the following particular form. 
4.2 Definition. A Gibbs state is defined by the values 

{9) ■■= Ie-'^'9, (25) 

where S, called the entropy of the state, is a Hermitian quantity with strongly integrable 
satisfying the normalization condition 

/e"^/^ = 1, (26) 

^ cf. Callen [26l p. 15]: "Operationally, a system is in an equilibrium state if its properties are consis- 
tently described by thermodynamic theory." 
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and k is the Boltzmann constant ([8]). The special case f = S/k, Z = lof Theorem 14.31 
below implies that a Gibbs state is indeed a state. 

What is traditionally (and in Section [2]) called entropy and denoted by S is in the present 
setting the value S = {S). What is here called entropy, has in the literature on statistical 
mechanics a variety of alternative names. For example, GiBBS |1H] (who first noticed the rich 
thermodynamic implications of states defined by fl25|l ) called —S the index of probability; 
Alhassid & Levine [3j and Balian [8] use the name surprisal for 5*. Our terminology is 
close to that of Mrugala et al. [89j, who call S the microscopic entropy, and Hassan et 
al. [58], who call S the information (al) entropy operator. 

4.3 Theorem. 

(i) A Gibbs state determines its entropy uniquely. 

(a) For any Hermitian quantity f with strongly integrable , the mapping (■)/ dehned by 

{g)f:=Z~'fe-fg, Z := Je'f (27) 
is a state. It is a Gibbs state with entropy 

S:=kif + \ogZ). (28) 
(Hi) The KMS condition (cf. KuBO JT^, Martin & Schwinger 1831 ) 

{gh)f = (hQjg) for bounded g, h (29) 
holds. Here Q / is the linear mapping defined by 

Qf9 ■= e'^ge^. 

Proof, (i) If the entropies S and S' define the same Gibbs state then 

J(e-^/^ - e-^'/^)^ = ^g) - (^) = 

for all g, hence e'"^/^ — e~^' 1^ = 0. This implies that e~^^^ = e~^'^^, hence 5* = 5" by 
Proposition IC.ll 

(ii) The quantity d := e"'^/^ is nonzero and satisfies d* = d, = d*d > 0. Hence Z > 
by (EA3), and p := Z~^e~-^ is Hermitian and nonnegative. For h > 0, the quantity g = \f] 
is Hermitian (by Proposition IC.ll) and satisfies gpg* = Z~^{gd){gd)* > 0, hence by (EA2), 

{h)f = {g*g)f = Jpg*g = Jgpg* > 0. 

Moreover, (1)/ = Z^^J e"^ = 1. Similarly, if g > then g = h*H with h = y/g = h* and 
with k := e~^^'^h, we get 

Z{g)f = Je-^hh* = Jh*e-^h = Jk*k > 0. 
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This implies (R3). the other axioms (R1)-(R4) follow easily from the corresponding prop- 
erties of the integral. Thus (■)/ is a state. Finally, with the definition (125]) . we have 

whence (■)/ is a Gibbs state. 

(iii) By (EA2), {hQfg)f = Je-fhQjg = JQjge'fh = je-fgh = {gh)f. □ 

Note that the state ( 1271) is unaltered when / is shifted by a constant. Q/ is called the 
modular automorphism of the state (•)/ since Qf{gh) = Q f{g)Q f{h); for a classical 
system, Qf is the identity. In the following, we shall not make use of the KMS condition; 
however, it plays an important role in the mathematics of the thermodynamic limit (cf. 
Thirring [122]). 

We call Z the partition function of /; it is a function of whatever parameters appear 
in a particular form given to / in the applications. A large part of traditional statistical 
mechanics is concerned with the calculation of the partition function Z and of special values 
when / is given. As we shall see, much of the qualitative theory of statistical mechanics 
is completely independent of the details involved, and it is this part that we concentrate 
upon in this paper. 

4.4 Example. A canonical ensembl^, is defined as a Gibbs state whose entropy is an 
affine function of a Hermitian quantity H, called the Hamiltonian: 

S = j3H + const, 

with a constant depending on /3, computable from fl28|) and the partition function 

Z = Je-^^ 

of / = (3H. In particular, if 5* is quantized then, in order that Z is finite, S and hence H 
must have a discrete spectrum that is bounded below, and the partition function takes the 
familiar form 

Z = tre-^^ = 5^6-^5^", (30) 
neAf 

where the En {n G A/") are the energy levels, the eigenvalues of H. If the spectrum of H 
is known, this leads to explicit formulas for Z. For example, a two level system is defined 
by the energy levels 0, E (or Eq and Eq + E, which gives the same results), and has 

Z = l + e-"^. (31) 

^ This traditional terminology appears to be unalterably fixed. We shall therefore use the term ensemble 
interchangable with state. However, we want to stress that, in the present setting, an ensemble may consist 
of a single system only, rather than - as in the traditional statistical interpretation - of a large collection 
of identically prepared systems. The latter interpretation has well-known difficulties to explain why each 
single macroscopic system is described correctly by thermodynamics. 
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It describes a single Fermion mode, but also many other systems at low temperature; cf. 
( I1U9I) . In particular, it is the basis of laser-induced chemical reactions in photochemistry 
(see, e.g., Karlov [BB], Murov et al. [SI]), where only two molecular energy levels are 
activated. 

For a harmonic oscillator, defined by the energy levels nE, n = 0, 1, 2, . . . and describing 
a single Boson mode, we have 

oo 
n=0 

Independent modes are modelled by taking tensor products of single mode algebras and 
adding their Hamiltonians, leading to spectra which are obtained by summing the eigenval- 
ues of the modes in all possible ways. The resulting partition function is the product of the 
single-mode partition functions. From here, a thermodynamic limit leads to the properties 
of ideal gases. Then nonideal gases due to interactions can be handled using the cumulant 
expansion, as indicated at the end of this section. The details are outside the scope of this 
paper. 

Since the Hamiltonian can be any Hermitian quantity, the quantum partition function for- 
mula (l30|l can in principle be used to compute the partition function of arbitrary quantized 
Hermitian quantities. 



5 Kubo product and generating functional 

The negative logarithm of the partition function, the so-called generating functional, plays 
a fundamental role in the foundation of thermodynamics. 

We first discuss a number of general properties, discovered by GiBBS [18], Peierls [98], 
BoGOLiUBOV [19], Kubo Mori [8^, and Griffiths [53]. The somewhat technical 
setting involving the Kubo inner product is necessary to handle noncommuting quantities 
correctly; everything would be much easier in the classical case. 

But the reader can be assured that (together with Appendix E]) this section is the only 
technical part of the paper, and the effort in understanding what goes on is more than 
compensated for by the ease with which everything else follows. On a first reading, the 
proofs in this section may simply be skipped. 

5.1 Proposition. Let f be Hermitian such that e^^ is strongly integrable for all s G [—1, 1] . 
Then 

{g;h)j:={gEjh)f, (32) 
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where Et is the hnear mapping dehned for Hermitian f by 



Efg:= f dse-'fge'^, 
Jo 



defines a bilinear, positive definite inner product (•;■)/ algebra of quantities, called 

the Kubo (or Mori or Bogoliubovj inner product. For all f,g, the following relations 
hold: 

{g;h)} = {h*;g*)j. (33) 
{g*;g)f>0 ifg^Q. (34) 
{g-h)f = g{h)f ifgeC, (35) 
{g; h) f = (gh) f if g or h commutes with /, (36) 
Ejg = g if g commutes with f, (37) 
If f = /(A) depends continuously differentiably on the real parameter vector A then 



Proof, (i) We have 

{g-h)) = {{gEfhr)f = {{Efhrg*)f = ( [' ds e'^ h*e~^f g*) = [\s{e^f h*e-'f g*) j. 

^ Jo ' f Jo 

The integrand equals 

Je-^e'^h*e-'^g* = je'f e'^h^e-'^ g* = Je-^ h*e-'f g*e'^ = {h*e-'f g*e'^) f 
by (EA2), hence 

Jo ^ Jo ' f 

Thus ([33D holds. 

(ii) Suppose that g ^ 0. For s G [0, 1], we define u = s/2, v = (1 — s)/2 and g{s) := e~'^^ge^^ . 
Since / is Hermitian, g^s)* = e""^ g*e~'^^, hence by (EA2) and (EA3), 

Je-^g*e''fge'f = Je^f ge'^^f g*e^f = Jg{syg{s) > 0, 

so that ^ 

{g*;g)f = {g*Efg)f= [ ds fe-^ g^e'^f gc'^ > 0. 

Jo 

This proves and shows that the Kubo inner product is positive definite. 
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(iii) If / and g commute then ge^^ = e^^g, hence 

Efg = / dse~'^e'fg = dsg = g, 
Jo Jo 

giving fl37|) . The definition of the Kubo inner product then imphes (1361) . and taking g E C 
gives fl35l) . 

(iv) The function g on [0, 1] defined by 

satisfies g(0) = and 

d ^-tfdftf , ( d .f\ jf d ^ .f ,j 



Hence q vanishes identically. In particular, q{l) = 0, giving (138|) . □ 



As customary in thermodynamics, we use differentals to express relations involving the 
differentiation by arbitrary parameters. To write fl38l) in differential form, we formally 
multiply by dX, and obtain the quantum chain rule for exponentials, 

de-^ = {-Efdf)e~f. (39) 

If the /(A) commute for all values of A then the quantum chain rule reduces to the classical 
chain rule. Indeed, then / commutes also with hence -E/^ = and Ejdf = df. 

The following theorem is central to the mathematics of thermodynamics. As will be apparent 
from the subsequent discussion, part (i) is the abstract mathematical form of the second 
law of thermodynamics, part (iii) is the abstract form of the first law, and part (ii) allows 
the actual computation of thermal properties from microscopic assumptions. 

5.2 Theorem. Let f be Hermitian such that e"^ is strongly integrable for all s E [—1, 1]. 
(i) The generating functional 

l^(/):=-log/e-^ (40) 
is a concave function of the Hermitian quantity f. In particular, 

W{g) < W{f) + {g- f)f. (Gibbs-Bogoliubov inequality) (41) 
Equality holds in ( 14T1) iff f and g differ by a constant. 
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(a) For Hermitian g, the cumulant expansion 



W{f + rg) = Wif) + T{g)f - ^-{{g)] - {g; g)j) + 0{r') (42) 
holds if the coefEcients are finite. 

(Hi) If f = /(A) and g = g{X) depend continuously differentiably on A then the following 
differentiation formulas hold: 

d{g)f={dg)f-{g-df)f+{g)f{df)f, (43) 

dWU) = {df)f. (44) 

(iv) The entropy of the state (•)/ is 

S = Hf-W{f)). (45) 
Proof. We prove the assertions in reverse order. 

(iv) Equation ( HOi) says tliat W{f) = —logZ, wliicli togetlier witli ( l28i) gives ( H5l) . 
(iii) We tiave 

dfge^-^ = Jdge^-^ + Jgde^-^ = Jdge^^ — JgEfdfe^-^ 
= fidg - gEfdf)e-f = Z{dg - gEjdf) j. 

On tlie otlier liand, djge~f = d{Z{g)f) = dZ{g)f + Zd{g)f, so tliat 

dZ{g) f + Zd{g) f = Z{dg- gEfdf) f = Z{dg) f-Z{g- df) f. (46) 

In particular, for g = 1 we find by fl35l) that dZ = —Z{1; df) f = —Z{df) f. Now (jUj) follows 
from dW{f) = -dlogZ = -dZ/Z = {df)f, and solving for d{g)f gives (|43l). 

(ii) We introduce the function defined by 

0(r) := Wif + rg), 

From (jllj), we find 0'(t) = (g) f+rg for f,g independent of r, and by differentiating this 
again, 

In particular. 



0'(O) = {g)j, </)"(0) = {g)} - {gEfg)j = {g)) - {g; g)j. (47) 
A Taylor expansion now implies (H2|) . 
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(i) Since the Cauchy-Schwarz equation for the Kubo inner product imphes 

{g)] = {g■^)]<{g■^g)s{l■^)f = {g■,g)s^ 

we see that 



2 



j^W{f + rg) 



< 

T = 



for all f,g. This implies that W{f) is concave. Moreover, replacing / by / + sg, we find 
that (f)"{s) < for all s. The remainder form of Taylor's theorem therefore gives 



0(r) = 0(0) + r0'(O) + / ds{T - s)0"(s) < 0(0) + r0'(O), 

Jo 

and for r = 1 we get 

W{f + g)<Wif) + {g)f. (4J 
1]) follows for r = 1 upon replacing g hj g — f. 



By the derivation, equality holds in (HHl) only if 0"(s) = for all < s < 1. By (1471) . 
applied with / + s(7 in place of /, this implies {g)jr^gg = {g]g)f+sg- Thus we have equality 
in the Cauchy-Schwarz argument, forcing g to he a multiple of 1. Therefore equality in the 
Gibbs-Bogoliubov inequality pTl) is possible only if g — f is a. constant. □ 



5.3 Theorem. Let Sc be the entropy of a reference state. Then, for an arbitrary Gibbs 
state {■) with entropy S, 

(S) < (5c), (49) 

with equality only if Sc = S. 

Proof. Let / = S/k and g = Sc/Tz. Since S and Sc are entropies, W{f) = W{g) = 0, and 
the Gibbs-Bogoliubov inequality ( HTl) gives < {g — f) f = {Sc~ S)/Ti. This implies ( H9i) . If 
equality holds then equality holds in fHTj) . so that Sc and 5* differ only by a constant. But 
this constant vanishes since the values agree. □ 



The difference 

{Sc -S) = {Sc) -{S)>0 (50) 

is known as relative entropy. It can be interpreted as the amount of information in a 
state (■) which cannot be explained by the reference state. This interpretation makes sense 
since the relative entropy vanishes precisely for the reference state. A large relative entropy 
therefore indicates that the state contains some important information not present in the 
reference state. 



Approximations. The cumulant expansion is the basis of a well-known approximation 
method in statistical mechanics. Starting from special reference states (■)/ with explicitly 
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known W{f) and Ef (corresponding to so-called explicitly solvable models), one obtains 
inductively expressions for values in these states by applying the differentiation rules. (In 
the most important cases, the resulting formulas for the values are commonly referred to 
as a Wick theorem, cf. WiCK [13 Ij . For details, see textbooks on statistical mechanics, 
e.g., Huang [61J, Reichl [106].) 

From these, one can calculate the coefficents in the cumulant expansion; note that higher 
order terms can be found by proceeding as in the proof, using further differentiation. This 
gives approximate generating functions (and by differentiation associated values) for Gibbs 
states with an entropy close to the explicitly solvable reference state. From the resulting 
generating function and the differentiation formulas fH3|) -f H^ . one gets as before the values 
for the given state. 

The best reference state (•) / to use for a given Gibbs state {■)g can be obtained by minimizing 
the upper bound in the Gibbs-Bogoliubov inequality (HTj) over the / for which an explicit 
generating function is known. Frequently, one simply approximates W{g) by the minimum 
of this upper bound. 



Using Wm{g) in place of W{g) defines a so-called mean field theory; cf. Callen |i26j. For 
computations from first principles (quantum field theory), see, e.g., the survey by Berges 
et al. [15]. 



6 Limit resolution and statistics 

In statistical mechanics, our values of quantities are called expectations, and refer to the 
mean over an ensemble of (real or imagined) identically prepared systems. While we keep 
the notation with pointed brackets familiar from statistical mechanics, we want to avoid 
any probabilistic connotation, hence use our more neutral term vthat does not require a 
statistical interpretation. 

We now discuss in which sense the interpretation of the values of quantities as objective, 
observer-independent properties is valid. The key is an analysis of the uncertainty inherent 
in the description of a system by a state, based on the following result. (Cf. Neumaier 
[93] for a preliminary version of this interpretation.) 

6.1 Proposition. For Hermitian g, 




(51) 



(9)' < id'). 



(52) 



Equality holds if g = {g) . 
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Proof. Fnt g={g). Then < {{g -gf) = {g^-2gg + t) = {g^)-2g{g)+t = {g'')-{g?- 
This gives f l52p . \i g = 'g then equahty holds in this argument. □ 



As a consequence, we can define the limit resolution 

Res(^7) := vW¥^, (53) 

of a Hermitian quantity g with nonzero value {g), an uncertainty measure specifying how 
accurately one can treat g as a, sharp number, given by this value. 

In experimental practice, the limit resolution is a lower bound on the relative accuracy with 
which one can expect {g) to be determinable reliabl}0 from measurements of a single system 
at a single time. In particular, a quantity g is considered to be significant if Res ((7) <C 1, 
while it is noise if Res((7) 3> 1. If is a quantity and is a good approximation of its value 
then e := g — 'g is noise. 

For a single system at a single time, values {g) have in intermediate cases, where Res((7) 
has the order of unity, no experimentally testable meaning. However, time averages of 'g{ti) 
at different times ti may have a meaning - they are the values of the quantity g defined as 
the average of the g{ti), and by the law of large numbers, this quantity may be significant 
even if each single g{ti) is not. 

Thus, this terminology captures correctly the experimental practice, without imposing any 
statistical or probabilistic connotations. On the contrary, it determines the precise condi- 
tions under which statistical reasoning is justified. 

Objective probability. The exposition in Whittle [130] (or, in more abstract terms, 
already Gelfand & Naimark [47J) shows that, if H and the Xj are pairwise commuting, 
there is a way consistent with the traditional axioms for probability theory (as formalized 
by KoLMOGOROV [7T]) to define, for any Gibbs state in our sense, random variables H and 
Xj such that the expectation of all sufficiently regular functions f{H,X) defined on the 
joint spectrum of {H, X) agrees with the value of /. Thus, in the pairwise commuting case, 
it is always possible to construct a probability interpretation for the quantities, completely 
independent of any assumed microscopic reality. 

However, if - as in quantum systems - the extensive quantities do not commute, a proba- 
bilistic interpretation in the traditional Kolmogorov sense is no longer possible. Insisting in 
the quantum case on a probabilistic interpretation brings with it the well-known difficulties 
of quantum philosophy for systems which cannot be repeatedly prepared. It is gratifying 
that the present setting avoids these difficulties in an elegant way: There is no need at all to 

^ The situation is analogous to the hmit resolution with which one can determine the longitude and 
latitude of a city such as Vienna. Clearly these are well-defined only up to some limit resolution defined 
by the diameter of the city. 
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introduce probabilities into the description; everything apphes to the single system under 
study. 

Subjective probability. Our formalism is closely related to that used in statistics for 
random phenomena expressible in terms of exponential families; cf. Remark I7.2( v) below. 
In such a context, a Bayesian, subjective probability interpretation (see, e.g., Barndorff- 
NlELSEN pn [12] ) of the formalism is possible; then our integral defines the noninformative 
prior, i.e., the least informative ensemble. (The noninformative prior is typically improper, 
i.e., not a probability distribution, since /I is usually not defined). 

While the Bayesian view gives, in my opinion, a misleading picture of statistical mechanics 
(see Appendix [B]) , such an interpretation is formally allowed (cf. Appendix |Al) if - and 
only if - (i) correct, complete and exact information about the expectation of all relevant 
quantities is assumed to be known, and (ii) the noninformative prior is fixed by the above 
constructions, namely as the correctly weighted Liouville measure in classical physics and as 
the microcanonical ensemble in quantum physics. Only this guarantees that the 'knowledge' 
assumed and hence the results obtained are completely impersonal and objective. However, 
this kind of knowledge is clearly completely hypothetical and has nothing to do with the 
real, subjective knowledge of real observers. Thus, it is better to avoid any reference to 
knowledge; it is neither useful nor necessary but confuses the subject. 

Statistics. In a context where many repeated experiments are feasible, states can be given 
a frequentist interpretation, where (g) is the expectation of g, empirically defined as an 
average over many realizations. In this case, Res((7) becomes the standard deviation of g, 
divided by the absolute value of the expectation; therefore, it measures the relative accu- 
racy of the realizations. However, as discussed in detail in the excellent survey by Sklar 
|115] . this interpretation has significant foundational problems, already in the framework 
of classical physics. And it seems especially inappropriate in equilibrium thermodynamics, 
where a tiny number of macroscopic observations on a single system completely determine 
its state to engineering accuracy. 

In the following, we shall, therefore, completely avoid any reference to probability and 
statistics. While keeping the terminology of ensembles in the cases where it is established 
(the canonical and the grand canonical ensemble), we use the term ensembl^ in a non- 
stochastic manner. We denote by it just a way of defining a state which provides for all 
quantities objective values satisfying the above properties. In contrast to tradition, we 
take {■) not to have an intrinsic probabilistic meaning as an average over a large statistical 
ensemble of 'true' microstates. 

In this way, we delegate statistics to its role as the art of interpreting measurements, as in 
classical physics. Indeed, to have a consistent interpretation, real experiments must be de- 
signed such that they allow one to determine approximately the properties of the state under 
study, hence the values of all quantities of interest. The uncertainties in the experiments 
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imply approximations, which, if treated probabihstically, need an additional probabihstic 
layer accounting for measurement errors. Expectations from this secondary layer, which 
involve probabilistic statements about situations that are uncertain due to neglected but 
observable details (cf. Peres [lOOj ). happen to have the same formal properties as the 
values on the primary layer (which contain a complete description of what is observable at 
a given energy scale), though their physical origin and meaning is completely different. 

In the following, we only consider the primary level, where measurement errors are consid- 
ered to be absent and an objective idealized description is assumed to be valid. 



7 The zeroth law: Thermal states 

We now restrict our attention to a restricted but very important class of Gibbs states, those 
describing thermal states. 

In thermodynamics, one distinguishes between extensive and intensive variables. Extensive 
variables such as mass, charge, or volume depend additively on the size of the system; 
the global quantities are given by fields interpreted as densities integrated over the region 
occupied by the system. The conjugate intensive variables are given by fields interpreted 
as field strengths; they act as parameters defining the state. They cannot be viewed as 
densities; instead, their differences or gradients have physical significance as the sources for 
thermodynamic forces. In particular, in the single-phase global equilibrium case treated in 
Section [21 where the densities are constant, the extensive variables are densities multiplied 
by the system size, hence scale linearly with the size of the system, while intensive variables 
are invariant under a change of system size. (We do not use the alternative convention 
to call extensive any variable which scales linearly with the system size, and intensive any 
variable invariant under a change of system size.) 

Thermal states are good models for macroscopic physical systems that are homogeneous on 
the level used for modeling. Indeed, as we shall see, they have all the properties tradition- 
ally postulated in thermodynamics. We distinguish four nested levels of thermal descrip- 
tions, global, local, microlocal and quantum equilibrium. The highest and computationally 
simplest level, global equilibrium, is concerned with macroscopic situations described by 
finitely many space- and time-independent variables. The next level, local equilibrium, 
treats macroscopic situations in a hydrodynamical or continuum mechanical description, 
where the relevant variables are finitely many space- and time-dependent fields, though for 
stirred chemical reactions, the space-dependence can be ignored. The next deeper level, 
microlocalfl equilibrium, treats mesoscopic situations in a kinetic description, where the 
relevant variables are finitely many fields depending on time, position, and momentum; cf. 
Balian [S]. The bottom level is the microscopic regime, where we must consider quantum 

* The term microlocal for a phase space dependent analysis is taken from the literature on partial 
differential equations; see, e.g., Martinez fMj. 
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equilibrium, which is described in terms of quantum dynamical semigroups. The relations 
between the different description levels will be discussed in Section [T31 Apart from descrip- 
tions on these clear-cut levels, there are also various hybrid descriptions, where the most 
important part of a system is described on a more detailed level than the remaining parts. 

In global equilibrium, all thermal variables are constant throughout the system, except at 
phase boundaries, where the extensive variables may exhibit jumps and only the intensive 
variables remain constant. This is sometimes referred to as the zeroth law of thermody- 
namics and characterizes global equilibrium; it allows one to measure intensive variables 
(like temperature) by bringing a calibrated instrument that is sensitive to this variable (for 
temperature a thermometer) into equilibrium with the system to be measured. For local 
or microlocal equilibrium, the same intuition applies, but with fields in place of variables. 
Then extensive variables are densities represented by distributions that can be meaning- 
fully integrated over bounded regions, whereas intensive variables are nonsingular fields 
(e.g., pressure) whose integrals are physically irrelevant. 

Although dynamics is important for systems not in global equilibrium, we ignore dynamical 
issues completely in the remainder of this paper. We take a strictly kinematic point of 
view, and look only at a single phase without chemical reactions. In a future paper, we 
shall extend the present setting to cover the dynamics of the nonequilibrium case and 
deduce quantitatively the dynamical laws (Beris & Eswards Oettinger ^) from 
microscopic properties, including phase formation, chemical reactions, and the approach to 
equilibrium; cf. Balian [8], Grabert [50], Rau & MuLLER |105j . Spohn [117] . 

In our present setting, the intensive variables are, as in Section [2|, numbers characterizing 
certain Gibbs states, parameterizing the entropy. To each admissible combination of in- 
tensive variables there is a unique thermal state providing values for all quantities. The 
extensive variables then appear as the values of corresponding extensive quantities. 

A basic extensive quantity present in each thermal system is the Hamilton energy H] in 
addition, there are further basic extensive quantities which we call Xj {j G J) and collect 
in a vector X, indexed by J. The number and meaning of the extensive variables depends 
on the type of the system; typical examples are given in Table [1] in Section [131 

In the context of statistical mechanics (cf. Examples 14. ip . the Euclidean *-algebra E is 
an algebra of functions (for classical physics) or linear operators (for quantum physics), 
and if is a particular function or linear operator called the Hamiltonian; specifying the 
precise form of the Hamiltonian is essentially equivalent to specifying the system under 
consideration. The form of the operators Xj depends on the level of thermal modeling; for 
further discussion, see Section [T3l 

For qualitative theory and for deriving semiempirical recipes, there is no need to know 
details about H or Xj; it suffices to treat them as primitive objects. The advantage we 
gain from our less detailed setting is that a much simpler machinery than that of statistical 
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mechanics proper suffices to reconstruct all of phenomenological thermodynamics. 

It is intuitively clear from the above informal definition of extensive variables that the only 
functions of independent extensive variables that are again extensive can be linear combina- 
tions, and it is a little surprising that the whole machinery of equilibrium thermodynamics 
follows from a formal version of the simple assumption that in thermal states the entropy is 
extensive. We take this to be the mathematical expression of the zeroth law and formalize 
this assumption in a precise mathematical definition. 

For notational simplicity, we consider mainly the case of global equilibrium, where there are 
only finitely many extensive variables. Everything extends, however, with (formally trivial 
but from a rigorous mathematical view nontrivial) changes to local and microlocal equilib- 
rium, where extensive variables are fields, provided the sums are replaced by appropriate 
integrals; cf. Oettinger [97] . 

7.1 Definition. A thermal system is defined by a family of Hermitian extensive vari- 
ables H and Xj {j G J) from a Euclidean *-algebra. A thermal state of a thermal system 
is a Gibbs state whose entropy is a linear combination of the basic extensive quantities 
of the form 

S = T'^(^H - ^ ajXj^ =T~^{H -a- X) (zeroth law of thermodynamics) (54) 



with suitable real numbers T ^ and aj 
components aj (j G J) and Xj (j G J, 
quantity g is 

9--={9) = 

where 

(3 



{j G J). (Here a and X are the vectors with 
respectively.) Thus the value of an arbitrary 

/e-/^(^-^)^, (55) 



The numbers aj are called the intensive variables conjugate to Xj, the number T is 
called the temperature, and /3 the coldness. S, H, X, T, and a are called the thermal 
variables of the system. Note that the extensive variables of traditional thermodynamics 
are in the present setting not represented by the extensive quantities S, H, Xj themselves 
but by their values S, H, X. 

Since we can write the zeroth law 0541) in the form 

H = TS + a- X, (57) 

called the Euler equation, the temperature T is considered to be the intensive variable 
conjugate to the entropy S. 

As already indicated in Example 12.31 measuring intensive variables is based upon the em- 
pirical fact (which cannot be formulated precisely in our purely kinematic setting) that two 
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systems in close contact which allows free exchange of some extensive quantity tend to relax 
to a joint equilibrium state, in which the corresponding intensive variable is the same in 
both systems. If a small measuring device is brought into close contact with a large system, 
the joint equilibrium state will be only minimally different from the original state of the 
large system; hence the intensive variables of the measuring device will move to the values 
of the intensive variables of the large system in the location of the measuring device. This 
allows to read off their value from a calibrated scale. 

Many treatises of equilibrium thermodynamics take this possibility of measuring tempera- 
ture to be the contents of the zeroth law of thermodynamics. The present, different choice 
for the zeroth law is superior since it has a definite mathematical content which has far 
reaching consequences. Indeed, as we shall see, our definition already implies the first and 
second law, and (together with a quantization condition) the third law of thermodynamics. 

7.2 Remarks, (i) We emphasize that the extensive quantities H and Xj are independent 
of the intensive quantities T and a, while S, defined by fl5^ . is an extensive quantity 
defined only when values for the intensive quantities are prescribed. Note that T and a 
vary with the state, and the entropy depends on them via flM|) : hence values also depend 
on the particular state a system is in. It is crucial to distinguish between the quantities H 
or Xj (which are independent of T and a and hence depend on the system but not on the 
state) and their values H = (H) or Xj = (Xj) (which change when the state of the system 
changes) . 

(ii) Although in thermodynamics, the emphasis is on the values of the thermal variables, it 
is important to realize that a thermal state gives complete information about the values ( l55i) 
of arbitrary quantities, not only the extensive ones. The few numbers or fields specifying T 
and a fully characterize a thermal state of a given system, while a general non-thermal state 
usually has a vastly higher complexity. In particular, global equilibrium is characterized by 
a small, finite number of variables, while already local equilibrium involves infinitely many 
degrees of freedom. 

(iii) Of course, the number of parameters depends on the true physical situation. A system 
in local equilibrium only cannot be adequately described by the few variables characterizing 
global equilibrium. The question of selecting the right set of extensive quantities for an 
adequate description is discussed in Section [T3l 

(iv) An arbitrary linear combination 



can be written in the form ( 15^ with T = I/7 and aj = —hj/'y, provided that 7 7^ 0; 
and indeed, ( l58l) is the mathematically more natural form, which also allows states of 
infinite temperature that are excluded in (15^ . However, the formulation as (15^ seems 
to be unalterably fixed by tradition, so we shall use it here, too. This shows that the 
coldness /5 is a more natural variable than the temperature T; it figures prominently in 



S = -fH + hoXo + . . . + KX, 



s 
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statistical mechanics. In the hmit T — 0, a system becomes infinitely cold, giving intuition 
for the unattainability of zero absolute temperature. States of negative temperature, i.e., 
negative coldness, must therefore be considered to be hotter, i.e., less cold, than states 
of any positive temperature, cf. Landau & Lifschitz [78]. To model thermal behavior 
at negative or infinite temperature, one can alternatively introduce a dummy temperature 
T = 1, a dummy Hamiltonian H = 0, and treating the true Hamiltonian and the coldness 
as additional components of X and a, respectively. 

(v) In mathematical statistics, there is a large body of work on exponential families, which 
is essentially the mathematical equivalent of the concept of a thermal state over a com- 
mutative algebra; see, e.g., Barndorff-Nielsen pJ[]. In this context, the values of the 
extensive quantities define a sufficient statistic, from which the whole distribution can be 
reconstructed (cf. Theorem 18.41 below and the remarks on objctive probability in Section 
[6]). This is one of the reasons why exponential families provide a powerful machinery for 
statistical inference; see, e.g., Bernardo & Smith [T7]. For recent extensions to quantum 
statistical inference, see, e.g., Barndorff-Nielsen et al. and the references there. 



8 The equation of state 

Not every combination (T, a) of intensive variables defines a thermal state; the requirement 
that (1) = 1 enforces a restriction of (T, a) to a manifold of admissible thermal states. 

8.1 Theorem. Suppose that T > 0. 

(i) For any k, > 0, the system function A defined by 

A(T, a) := ^Tlog /e-^^^""-^) (59) 

is a convex function ofT and a. It vanishes only ifT and a are the intensive variables of 
a thermal state. 

(a) In a thermal state, the intensive variables are related by the equation of state 

A(T,a) = 0. (60) 
The state space is the set of (T, a) satisfying ( l60l) . 
(Hi) The values of the extensive variables are given by 
- dA _ aA 

S = n—(T,a), X = n—(T,a) for some Q > 0, 61) 
oT oa 

and the phenomenological Euler equation 

H = TS + a-X. (62) 
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(iv) Regarding S and X as functions ofT and a, the matrix 



E := 



/ dS_ dS_\ 

&T da 

dX dX 

\~df ~da ^ 



(63) 



is symmetric and positive semidefinite; in particular, we have the Maxwell reciprocity 
relations 



dX, dXi dXi dS 



and the stability conditions 



dS ^ dX. ^ 

>0, Tr-^>0 (jGJ). 



dT 



(64) 



(65) 



Proof. By Theorem 15.2( 1). the function cj) defined by 

0(ao, a) := log /e-^""-^""'^) = -W^a^H - a ■ X) 
is a convex function of and a. Put Vt = k/ n. Then, by Proposition IC.4[ 



A(r, a) = -kTW{(3{H -a-X)) = '^^^(^ — ^ 



(66) 



is also convex. The condition A(T, a) = is equivalent to 
the condition for a thermal state. This proves (i) and (ii). 

(iii) The formulas for S and X follow by differentiation of ( |66l) with respect to T and a, 
using (jH]). Equation fl62l) follows by taking values in fl57j) . noting that T and a are real 
numbers. 



I^iv) By (iii), the matrix 



dTda 
d^A 



d^A 

V dadT 'do^ J 

is the Hessian matrix of the convex function A. Hence E is symmetric and positive semidefi- 
nite. fl64l) expresses the symmetry of E, and ( !65l) holds since the diagonal entries of a positive 
semidefinite matrix are nonnegative. □ 
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8.2 Remarks, (i) For T < 0, the same results hold, with the change that A is concave 
instead of convex, S is negative semidefinite, and the inequality signs in fl65l) are reversed. 
This is a rare situation; it can occur only in (nearly) massless systems embedded within 
(much heavier) matter, such as spin systems (cf. Purcell & Pound |104j ) or vortices in 
2-dimensional fluids (cf. MONTGOMERY & Joyce [87], Eyinck & Spohn |12]). A massive 
thermal system couples significantly to kinetic energy. In this case, the total momentum 
p is an extensive quantity, related to the velocity f , the corresponding intensive variable, 
by p = Mf , where M is the extensive total mass of the system. From ( 16T1) . we find that 
p = QdA/dv, which implies that A = A|„=o + i^'^^- Since the mass is positive, this 
expression is convex in v, not concave; hence T > 0. Thus, in a massive thermal system, 
the temperature must be positive. 

(ii) In the application, the free scaling constant n is usually chosen as k, = k/Q, where Q 
is a measure of system size, e.g., the total volume or total mass of the system. In actual 
calculations from statistical mechanics, the integral is usually a function of the system 
size. To make the result independent of it, one performs the so-called thermodynamic limit 
— oo; thus Q must be chosen in such a way that this limit is nontrivial. Extensivity in 
single phase global equilibrium then justifies treating fl as an arbitrary positive factor. 

The equation of state shows that, apart from possible singularities describing so-called 
phase transitions, the state space has the structure of an {s — l)-dimensional manifold in 
W, where s is the number of intensive variables. 

In phenomenological thermodynamics (cf. Section [2]), one makes suitable, more or less 
heuristic assumptions on the form of the system function, while in statistical mechanics, 
one derives its form from fl^U]) and specific choices for the quantities H and X within 
one of the settings described in Example 14.11 Given these choices, the main task is then 
the evaluation of the system function (|59|) since everything else can be computed from it. 
Frequently, fl59|) can be approximately evaluated from the cumulant expansion (jl2|) and/or 
a mean field approximation flSTl) . 

An arbitrary Gibbs state is generally not a thermal state. However, we can try to approxi- 
mate it by an equilibrium state. 

8.3 Theorem. Let (■) be a Gibbs state with entropy S. Then, for arbitrary (T, a) satis- 
fying T > and the equation of state fl60|) . the values H = {H), S = {S), and X = {X) 
satisfy 

H >TS - a-X. (67) 
Equahty only holds if S is the entropy of a thermal state with intensive variables (T, a) . 

Proof. The equation of state implies that Sc '■= T~^{H — a ■ X) is the entropy of a thermal 
state. Now the assertion follows from Theorem 15.31 since {S) < (Sc) = T~^{{H) — a ■ {X)), 
with equality only if S' = Sc- □ 
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As the theorem shows, everything of macroscopic interest is deducible from an exphcit 
formula for the system function. This explains why, in many situations, one can use ther- 
modynamics very successfully as a phenomenological theory without having to bother about 
microscopic details. It suffices that a phenomenological expression for A(T, a) is available. 
In particular, the phenomenological axioms from Section [2] now follow by specializing the 
above to a standard system, characterized by the extensive quantities 

H,Xo = V, X, = iV, (jVO), (68) 

where, as before, V denotes the (positive) volume of the system, and each Nj denotes 
the (nonnegative) number of molecules of a fixed chemical composition (we shall call these 
particles of species j). However, H and the Nj are now quantities from E, rather than 
thermal variables. 

P := -ao (69) 

is called pressure, and 

/X, (jVO) (70) 

the chemical potential of species j; hence 

a-X = -PV + H-N. 

Specializing the theorem, we find the phenomenological Euler equation 

H = TS - PV + n-N. (71) 

For reversible changes, we have the first law of thermodynamics 

dH = TdS - PdV + iJ-dN (72) 

and the Gibbs-Duhem equation 

= SdT - VdP + N-dfi. (73) 

A comparison with Section [2] shows that dropping the bars from the values reproduces for 
T > 0, P > and S > the axioms of phenomenological thermodynamics, except for 
the extensivity outside equilibrium (which has local equilibrium as its justification). The 
assumption T > was justified in Remark [HIW i). and S >0 will be justified in Section [T2l 
But I have been unable to find a theoretical argument which shows that the pressure of a 
standard system in the above sense must always be positive. (At T < 0, negative pressure is 
possible; see Example 18.51 ) I'd appreciate getting information about this from my readers. 

Apart from boundary effects, which become more and more unimportant as the system 
gets larger, the extensive quantities scale linearly with the volume. In the thermodynamic 
limit, corresponding to an idealized system infinitely extended in all directions, this be- 
comes exact, although this can be proved rigorously only in simple situations, e.g., for hard 
sphere model systems (Yang & Lee |132] ) or spin systems (Griffiths [53]). A thorough 
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treatment of the thermodynamic hmit (e.g., Thirring |122j . or, in the framework of large 
deviation theory, Ellis in general needs considerably more algebraic and analytic 
machinery, e.g., the need to work in place of thermal states with more abstract KMS-states 
(which are limits of sequences of thermal states still satisfying a KMS condition fl29|) ). 
Moreover, proving the existence of the limit requires detailed properties of the concrete 
microscopic description of the system. 

For very small systems, typically atomic clusters or molecules, is fixed and a canonical 
ensemble without the /i ■ term is more appropriate. For the thermodynamics of small 
systems (see, e.g., (Bustamente et al. [25], Gross [51], Kratky [72]) such as a single 
cluster of atoms, V is still taken as a fixed reference volume, but now changes in the physical 
volume (adsorption or dissociation at the surface) are not represented in the system, hence 
need not respect the thermodynamic laws. For large surfaces (e.g., adsorption studies 
in chromatography [69l l85]). a thermal description is achievable by including additional 
variables (surface area and surface tension) to account for the boundary effects; but clearly, 
surface terms scale differently with system size than bulk terms. 

Thus, whenever the thermal description is valid, computations can be done in a fixed 
reference volume Vq which we take as system size Q, and the true, variable volume V can 
always be represented in the Euclidean *-algebra as a real number, so that in particular 
V = V. Then (159|1 implies that, for the reference volume, 

A(T,a) = fi-i/^riog(e-^™)/e-^(^-^-^)), 

hence 

A(T, a) = n-^kT{\og Z{T, /i) - Pil) = P{T, fi) - P, (74) 

where 

Z{T, fi) := j-e-^(^-^-^) (75) 
is the so-called grand canonical partition function of the system and 

P(T,/i) := n-^TcT log Z{T,ij,). (76) 

With our convention of considering a fixed reference volume and treating the true volume 
\^ as a scale factor (otherwise a thermodynamic limit would be needed), this expression is 
independent of V, since it relates intensive variables unaffected by scaling. The equation 
of state fl60|) therefore takes the form 

p = p(r,/i). (77) 

Quantitative expressions for the equation of state can often be computed from (I751) - (I7SI) 
using the cumulant expansion 0421) and/or a mean field approximation fl5ip . Note that 
these relations imply 

g-/3P(T,M)y ^ j^-l3{H-,,-N)_ 

Traditionally (see, e.g., GiBBS [H], Huang [61], Reichl |106j ). the thermal state corre- 
sponding to (!74l) - (176l) is called a grand canonical ensemble, and the following results 
are taken as the basis for microscopic calculations from statistical mechanics. 
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8.4 Theorem. For a standard system in global equilibrium, values of an arbitrary quantity 
g can be calculated from (175]) and 

The values of the extensive quantities are given in terms of the equation of state (1761) by 



dP — dP 



S = V^(T,^). N, = V—{T,^l) (79) 



and the phenomenological Euler equation (17T1) . 



Proof. Equation (1761) implies that 



(g) = Je-s/^g = Je-PiH+PV-^.■N)g 



e 



giving (175]) . The formulas in (179!) follow from (1611) and (171]) . □ 



No thermodynamic limit was needed to derive the above results. Thus, everything holds 
- though with large limit resolutions in measurements - even for single small systems 
(BusTAMENTE et al. [25], Gross [54j, Kratky [72]). 



8.5 Example. We consider the two level system from Example 14.4] using = 1 as system 
size. From (175]) and (175]) . we find Z{T, fx) = 1 + e~^^^'^, hence 

P(T,/i) = kT\og{l + e-^/^^) = /^Tlog(e^/^^ + 1) - ^. 

From (1 78 p . we find 



l + e-E/^T + \og{E/H-iy 

(This implies that a two-level system has negative temperature and negative pressure if 
H > E/2.) The heat capacity C := dH/dT takes the form 



It exhibits a pronounced maximum, the so-called Schottky bump (cf. C ALLEN [26]), 
from which E can be determined. In view of ( llOOp below, this allows the experimental 
estimation of the spectral gap of a quantum system. The phenomenon persists to some 
extent for multilevel systems; see Civitarese et al. |32j . 
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9 Description levels 



So far, we have assumed a fixed selection of extensive quantities defining tlie tliermal model. 
However, as indicated at the end of Section [2], observable differences from the conclusions 
derived from a thermal model imply that one or more conjugate pairs of thermal variables 
are missing in the model. In this section, we discuss in more detail the relation between dif- 
ferent thermal models constructed on the basis of the same Euclidean *-algebra by selecting 
different lists of extensive quantities. 

Our first observation is the flexibility of the thermal setting. While the zeroth law may 
look very restrictive at first sight, by choosing a large enough family of extensive quantities 
the entropy of an arbitrary Gibbs state can be approximated arbitrarily well by a linear 
combination of these quantities. 

The zeroth law thus simply appears as an embodiment of Ockham's razoil§ [96], freely 
paraphrased in modern form: that we should opt for the most economic model explaining 
a phenomenon - by restricting attention to the relevant extensive quantities only. At each 
time t, there is - except in degenerate cases - a single Gibbs state, with entropy S{t), 
say, which best describes the system under consideration. Assuming the description by the 
Gibbs state as fundamental, its value is the objective, true value of the entropy, relative 
only to the algebra of quantities chosen to model the system. A description of the state in 
terms of a thermal system is therefore adequate if (and, under an observability qualification 
to be discussed below, only if), for all relevant times t, the entropy S(t) can be adequately 
approximated by a linear combination of the extensive quantities available at the chosen 
level of description. 

The set of extensive variables depends on the application and on the desired accuracy of 
the model; it must be chosen in such a way that knowing the measured values of the exten- 
sive variables determines (to the accuracy specified) the complete behavior of the thermal 
system. Thus, the choice of extensive variables is (to the accuracy specified) completely de- 
termined by the level of accuracy with which the thermal description should fit the system's 
behavior. This forces everything else: The theory must describe the freedom available to 
characterize a particular thermal system with this set of extensive variables, and it must 
describe how the numerical values of interest can be computed for each state of each thermal 
system. 

In contrast to the information theoretic approach where the choice of extensive quantities 
is considered to be the subjective matter of which observables an observer happens to have 
knowledge of, the only subjective aspect of our setting is the choice of the resolution of 
modeling. This fixes the amount of approximation tolerable in the ansatz, and hence the 
necessary list of extensive quantities. (Clearly, physics cannot be done without approxima- 
tion, and the choice of a resolution is unavoidable. To remove even this trace of subjectivity, 

^ frustra fit per plura quod potest fieri per pauciora 
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inherent in any approximation of anything, the entropy would have to be represented with- 
out any approximation, which would require to use as the algebra of quantities the still 
unknown theory of everything, and to demand that the extensive quantities exhaust this 
algebra.) 

Table 1: Typical conjugate pairs of thermal variables and their contribution to the Euler 
equation. The signs are fixed by tradition, (in the gravitational term, m is the vector with 
components rrij, the mass of a particle of species j, g the acceleration of gravity, and h the 
height.) 



extensive Xj 


intensive aj 


contribution ajXj 


entropy S 


temperature T 


thermal, TS 


particle number Nj 


chemical potential fij 


chemical, fJ^jNj 


conformation tensor C 


relaxation force R 


conformational Yl RjkC^^ 


strain e^'^ 


stress ajk 


elastic, ^ (Jjk^^'^ 


volume V 


pressure —P 


mechanical, —PV 


surface As 


surface tension 7 


mechanical, "^As 


lenght L 


tension J 


mechanical, JL 


displacement q 


force —F 


mechanical, —F ■ q 


momentum p 


velocity v 


kinetic, v ■ p 


angular momentum J 


angular velocity Q 


rotational, Q ■ J 


charge Q 


electric potential $ 


electrical, 


polarization P 


electric field strength E 


electrical, E ■ P 


magnetization M 


magnetic field strength B 


magnetical, B ■ M 


e/m field F 


e/m field strength — 


electromagnetic, — ^ F^^F^^ 


mass M = m ■ N 


gravitational potential gh 


gravitational, ghM 


energy-momentum U 


metric g 


gravitational, Yl QnvU^" 



In general, which quantities need to be considered depends on the resolution with which 
the system is to be modeled - the higher the resolution the larger the family of extensive 
quantities. Thus - whether we describe bulk matter, surface effects, impurities, fatigue, 
decay, chemical reactions, or transition states, - the thermal setting remains the same since 
it is a universal approximation scheme, while the number of degrees of freedom increases 
with increasingly detailed models. 

In phenomenological thermodynamics, the relevant extensive quantities are precisely those 
variables that are observed to make a difference in modeling the phenomenon of inter- 
est. Table [1] gives typical extensive variables, their intensive conjugate variables, and their 
contribution to the Euler equatior0. Some of the extensive variables and their intensive 

The Euler equation, which contains the energy contributions specified in the table, looks like an energy 
balance. But since S is undefined, this formal balance has no contents apart from defining the entropy S in 
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conjugates are vectors or (in elasticity theory, the theory of complex fluids, and in the rela- 
tivistic case) tensors; cf. Balian [7J for the electromagnetic fleld and Beris & Edwards 
[TBi| . Ottinger [97J for complex fluids. 



To analyze the relation between two different thermal description levels, we compare a coarse 
system and a more detailed system quantitatively, taking for simplicity the temperature 
constant, so that the T-dependence can be suppressed in the formulas, and states are 
completely determined by a. 

The flne system will be written as before, the variables and quantities associated with the 
coarser system get an additional index c. In order to be able to compare the two systems, 
we assume that one is a reflnement of the other, so that the extensive quantities of the 
coarse system are Xc = CX, with a flxed matrix C with linearly independent rows, whose 
components tell how the components of X^ are built from those of X. The entropy of the 
coarse system is then given by 



Thus, the thermal states of the coarse model are simply the states of the detailed model 
for which the intensive parameter vector a is of the form a = C^ac for some etc- Thus the 
coarse state space can simply be viewed as a lower-dimensional subspace of the detailed 
state space. Therefore one expects the coarse description to be adequate precisely when 
the detailed state is close to the coarse state space, with an accuracy determined by the 
desired fidelity of the coarse model. Since the relative entropy (l50ll . 



{S, -S) = {T-\H - ae ■ CX) - T-\H - a ■ X)) = {T-\a - C*aJ ■ X), (81) 



measures the amount of information in the detailed state which cannot be explained by the 
coarse state, it is sensible to associate to an arbitrary detailed state a the coarse state ac 
determine by minimizing flSTl) . If a* = C^Uc ~ a then 



and the coarse description is adequate. If a* 9^ a, there is no a priori reason to trust the 
coarse model, and we have to investigate to which extent its predictions will significantly 

terms of the energy and other contributions. The energy balance is rather given by the first law discussed 
later, and is about changes in energy. Conservative work contributions are exact differentials. For example, 
the mechanical force F = —dV{q)/dq translates into the term ~F-dq = dV{q) of the first law, corresponding 
to the term —F-q'va the Euler equation. The change of the kinetic energy -Bkin = mv^ jl contribution of 
linear motion with velocity v and momentum p = rav is dE^^^ = dimv^ jl) — mv ■ dv — v ■ dp, which is 
exactly what one gets from the v ■ p contribution in the Euler equation. Since v ■ p — mv^ is larger than the 
kinetic energy, this shows that motion implies a contribution to the entropy of (i?kin 'P)/T = —mv^/2T. 
A similar argument applies to the angular motion of a rigid body in its rest frame, providing the term 
involving angular velocity and angular momentum. 



Sc = T-\H - ac ■ Xc) = T-\H - a, • CX) = T'^H - a* ■ X), 



where 



a* = C'^ttc- 




Sc = T-\H - a* ■ X) ^ T-^{H -a* ■X) = S, 
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differ from those of the detailed modeL One expects the differences to be significant; how- 
ever, in practice, there are difficulties if there are limits on our ability to prepare particular 
detailed states. The reason is that the entropy and chemical potentials can be prepared and 
measured only by comparison with states of sufficiently known states. A first sign of this 
is the gauge freedom in ideal gases discussed in Example 12. 3[ which implies that different 
models of the same situation may have nontrivial differences in Hamilton energy, entropy, 
and chemical potential. This ambiguity persists in more perplexing situations: 

9.1 Example. (The Gibbs paradox) 

Suppose that we have an ideal gas of two species j = 1, 2 of particles which are experi- 
mentally indistinguishable. Suppose that in the samples available for experiments, the two 
species are mixed in significantly varying proportions Ni : N2 = qi : q2 which, by assump- 
tion, have no effect on the observable properties; in particular, their values are unknown 
but varying. The detailed model treats them as distinct, the coarse model as identical. 
Reverting to the barless notation of Section [21 we have 













iV, 








[N,J 




{ fl2 1 



and, assuming C = [\ ^ ) for suitable Ci,C2 > 0, 

U Cl C2 



c 



From the known proportions, we find 



The mixture behaves like an ideal gas of a single species, hence 

kTN 

PV = kTN,, H = h,iT)N,, /i, = ^Tlog. ' 



Now Nc = {kT)^^PV = — J2^j-^c implies that Xi + X2 = 1. Because of indistin- 

guishability, this must hold for any choice of qi,q2 > 0; for the two choices gi = and 
g2 = 0, we get ci = C2 = 1, hence Nc = '^Nj, and the Xj are mole fractions. Similarly, 
if we use for all species j the same normalization for fixing the gauge freedom, the rela- 
tion hc{T)Nc = H = 'Yli^j{T)Nj = hj{T)xjNc implies for varying mole fractions that 
hj{T) = hc{T) for j = 1, 2. From this, we get vrj(T) = T^dT) for j = 1, 2. Thus 

H-H, = J2 HT)N, - K{T)N, = 0, 

kTN. kTN^ 
- ix^ = kT log — kT log — — = kT log xj, 
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the Gibbs energy satisfies 



and the entropy satisfies 

S-Sc = T-\H - PV + G) - T-\Hc - PV + Gc) 
= T-\G - G,) = kN, Y: Xj log X,. 

This term is called the entropy of mixing. Its occurence is referred to as Gibbs paradox 
(cf. Jaynes Tseng & Caticha [125j . Allahverdyan & Nieuwenhuizen [4J, 
Uffink |126l Section 5.2]). It seems to say that there are two different entropies, depending 
on how we choose to model the situation. For fixed mole fractions, the paradox can be 
resolved upon noticing that the fine and the coarse description differ only by a choice of 
gauge; the gauge is unobservable anyway, and the entropy is determined only when the 
gauge is fixed. 

However, if the mole fractions vary, the fine and the coarse description differ significantly. If 
the detailed model is correct, the coarse model gives a wrong description for the entropy and 
the chemical potentials. However, this difference is observable only if we know processes 
which affect the different species differently, such as a difference in mass, which allows 
a mechanical separation, in molecular size or shape, which allows their separation by a 
semipermeable membrane, in spin, which allows a magnetic separation, or in scattering 
properties of the particles, which may allow a chemical or radiation-based differentiation. 
In each of these cases, the particles become distinguishable, and the coarse description 
becomes inadequate. 

If we cannot separate the species to some extent, we cannot prepare equilibrium ensembles 
at fixed mole fraction. But this would be necessary to calibrate the chemical potentials, since 
fixed chemical potentials can be prepared only through chemical contact with substances 
with known chemical potentials, and the latter must be computed from mole fractions. 

Generalizing from the example, we conclude that even when both a coarse model and a more 
detailed model are faithful to all experimental information possible at a given description 
level, there is no guarantee that they agree in the values of all thermal variables of the coarse 
model. In the language of control theory (see, e.g., Ljung [82]), agreement is guaranteed 
only when all parameters of the more detailed models are observable. 

On the other hand, all observable state functions of the detailed system which depend only 
on the coarse state must and will have the same value within the experimental accuracy 
if both models are adequate descriptions of the situation. Thus, while the values of some 
variables need not be experimentally determinable, the validity of a model is an objective 
property. Therefore, preferences for one or the other of two valid models can only be based 
on other criteria. The criterion usually employed in this case is Ockham's razor^, although 
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there may be differences of opinion on what counts as the most economic model. In partic- 
ular, a fundamental description of macroscopic matter by means of quantum mechanics is 
hopelessly overspecified in terms of the number of degrees of freedom needed for compar- 
ison with experiment, most of which are in principle unobservable by equipment made of 
ordinary matter. But it is often the most economical model in terms of description length 
(though extracting the relevant information from it may be difficult). Thus, different people 
may well make different rational choices, or employ several models simultaneously. 

The objectivity of a model description implies that, as soon as a discrepancy with ex- 
periment is reliably found, the model must be replaced by a more detailed (or altogether 
different) model. This is indeed what happened with the textbook example of the Gibbs 
paradox situation, ortho and para hydrogen, cf. Bonhoeffer & Harteck [20j, Farkas 
[13]. Hydrogen seemed at first to be a single substance, but then thermodynamic data 
forced a refined description. 

Similarly, in spin echo experiments (see, e.g., Hahn ^EH], Rothstein |114] . Ridder- 
BOS & Redhead |lU9j ). the specially prepared system appears to be in equihbrium but, 
according to Callen's empirical definitiorP it is not - the surprising future behavior (for 
someone not knowing the special preparation) shows that some correlation variables were 
neglected that are needed for a correct description. Indeed, everywhere in science, we strive 
for explaining surprising behavior by looking for the missing variables needed to describe 
the system correctly! 

Grad jST] expresses this as "the adoption of a new entropy is forced by the discovery of new 
information" . More precisely, the adoption of a new model is forced, since the old model 
is simply wrong under the new conditions and remains valid only under some restrictions. 
Thus this is not a property of entropy alone, but of all concepts in models of reality relating 
to effects not observable (in the sense of control theory discussed above). 

Observability issues aside, the coarser description usually has a more limited range of appli- 
cability; with the qualification discussed in the example, it is generally restricted to those 
systems whose detailed intensive variable vector a is close to the subspace of vectors of the 
form C'^a reproducible in the coarse model. Finding the right family of thermal variables 
is therefore a matter of discovery, not of subjective choice. This is further discussed in 
Section [131 



10 The first law: Energy balance 

We now discuss relations between changes of the values of extensive or intensive variables, 
as expressed by the first law of thermodynamics. To derive the first law in full generality, 
we use the concept of reversible transformations introduced in Section [2l Gorresponding to 
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such a transformation, there is a family of thermal states {■)\ defined by 

Important: In case of local or microlocal equilibrium, where the thermal system carries 
a djTiamics, it is important to note that reversible transformations are ficticious transfor- 
mations which have nothing to do with how the system changes with time, or whether a 
process is reversible in the dynamical sense that both the process and the reverse process 
can be realized dynamically. The time shift is generally not a reversible transformation. 

We use differentials corresponding to reversible transformations; writing f = S/k, we can 
delete the index / from the formulas in Section |H In particular, we write the Kubo inner 
product (!32|) as 

{g;h) := {g;h)s/u. (82) 

10.1 Proposition. The value g{T, a) := {g{T, a)) of every (possibly T- and a-dependent) 
quantity g{T, a) is a state variable satisfying the differentiation formula 

d{g) = {dg)-{g-g;dS)/k. (83) 



Proof. That is a state variable is an immediate consequence of the zeroth law (15^ since 
the entropy depends on T and a only. The differentiation formula follows from ( H3l) and 



10.2 Theorem. For reversible changes, we have the first law of thermodynamics 

dH = TdS + a-dX (84) 

and the Gibbs-Duhem equation 

= SdT + X-da. (85) 



Proof. Differentiating the equation of state fl60l) . using the chain rule (11 01) . and simplifying 
using ( l6Ti) gives the Gibbs-Duhem equation ( l85l) . If we differentiate the phenomenological 
Euler equation (l62l) . we obtain 

dH = TdS + SdT + a ■ dX + X ■ da, 

and using (J85ll . this simplifies to the first law of thermodynamics. □ 



Because of the form of the energy terms in the first law (18^ . one often uses the analogy 
to mechanics and calls the intensive variables generalized forces, and differentials of 
extensive variables generalized displacements. 



49 



For the Gibbs-Duhem equation, we give a second proof which provides additional insight. 
Since H and X are fixed quantities for a given system, they do not change under reversible 
transformations; therefore 

dH = 0, dX = 0. 
Differentiating the Euler equation ( 1571) . therefore gives the relation 

= TdS + SdT + X -da. (86) 

On the other hand, 5* depends explicitly on T and a, and by Corollary I C. 3 1 



(dS) 



e-^/^dS = M e~^/^ \=Ml = Q 



taking values in fl86l) implies again the Gibbs-Duhem equation. Equation (186|) can also be 
used to get information about limit resolutions. 

10.3 Theorem. 

(i) Let g he a quantity depending continuously differentiable on the intensive variables T 
and a. Then 

/dg_y 



{g - 9;S - S) = kT 
,-g-X,~X^) = kT 



dT \dT/ 



dg_ 



dg 



(a) If the extensive vanables H and Xj (j E J) are pairwise commuting then 

dS 



{{S-Sr) = kT:^ 
{{X,-Xj){S-S)) = JcT 



dT'' 
dX, 



{{X^-X,){Xk-Xk)) = kT 



dT 
dX, 
dak 



{j,ke J), 



rkT dS 

Res(S) = ^/^ — , Res(X,, 



kT dXj 



Res{H) 



dH- 

H^v dT + 



'kT/dH an\ 
(T ^ V 



(89) 

(90) 

(91) 
(92) 

(93) 
(94) 



Proof. Multiplying the differentiation formula fl83l) by kT and using fl86l) . we find, for 
arbitrary reversible transformations. 



kT{d{g) - (dg)) = {g - g; S)dT + {g- g;X) ■ da. 
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Dividing by dX and choosing X = T and X = aj, respectively, gives 

(i) follows upon noting that {g — g; h — h) = {g — g; h) since by fl35l) . 

{g-g-h) = {g-g)h={{g)-g)=Q. 

If the extensive variables H and (j G J) are pairwise commuting then we can use fl36l) 
to eliminate the Kubo inner product, and by choosing g as S and Xj, respectively, we 
find (jnnD^dnSD- The limit resolutions fl55]) now follow from (1551) and the observation that 
{{.9-9?) = {{9-9)9) - {9-9)9 = {{9-9)9) = {9^) -g"^- The limit resolution ([94]) follows 
similarly from 

Res{H? = {H -H^ -H) =_T{H -H;S -S) +a - {H -H;X -X) 

□ 



Note that higher order central moments can be obtained in the same way, substituting 
more complicated expressions for / and using the formulas for the lower order moments to 
evaluate the right hand side of flHHl) and fl89|) . 

The extensive variables scale linearly with the system size fl of the system. Hence, the 
limit resolution of the extensive quantities is 0{^jTi/Vt) in regions of the state space where 
the extensive variables depend smoothly on the intensive variables. Since k is very small, 
they are negligible unless the system considered is very tiny. Thus, macroscopic thermal 
variables can generally be obtained with fairly high precision. An exception is close to 
critical points where the extensive variables are not differentiable, and their derivatives can 
therefore become huge. In particular, in the thermodynamic limit VL — »• cx3, uncertainties 
are absent except close to a critical point, where they lead to critical opacity. 

10.4 Corollary. For a standard thermal system, 



Res{S) = yl^^, Res{N,) 



Resm = ^=,(r- + P^ + ^.-). (96) 
Prooj. Use (I90D and ([92]). □ 



(95) 



Note that Res(V^) = since we regarded as a number. 
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11 The second law: Extremal principles 



The extremal principles of the second law of thermodynamics assert that in a nonthermal 
state, some energy expression depending on one of a number of standard boundary condi- 
tions is strictly larger than that of related thermal states. The associated thermodynamic 
potentials can be used in place of the system function to calculate all thermal variables 
given half of them. Thus, like the system function, thermodynamic potentials give a com- 
plete summary of the equilibrium properties of homogeneous materials. We only discuss 
the Hamilton potential 

U(S, X) := mSix{TS + a-X\ A(T, a) = 0, T > 0} 

T,a 

and the Helmholtz potential 

A{T,X) ■= max {a - X I A(T,a) = 0}; 

a 

other potentials can be handled in a similar way. 

11.1 Theorem. (Second law of thermodynamics) 

(i) In an arbitrary state, 

H>U(S,X), 

with equality iff the state is a thermal state of positive temperature. The remaining thermal 
variables are then given by 

dU dU 

r=-=(5,X), a = ^{S,X), (97) 

U = H = U(S,X). (98) 

In particular, a thermal state of positive temperature is uniquely determined by the values 
of S and X. 

(a) Let T > 0. Then, in an arbitrary state, 

H-TS> A(T,X), 

with equality iff the state is a thermal state of temperature T. The remaining thermal 
variables arc then given by 

S^- — {T,X), a^^{T,X), (99) 

H ^TS + a-X ^A{T,X)+TS. (100) 

Jii particular, a thermal state of positive temperature is uniquely determined by the values 
of T and X. 
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Proof. This is proved in the same way as Theorem 13. 2( thus we give no details. 



□ 



The additivity of extensive quantities is again reflected in corresponding properties of the 
thermodynamic potentials: 

11.2 Theorem. 

(i) The function U{S,X) is a convex function of its arguments which is positive homoge- 



neous of degree 1, i.e., for real A, A^, A^ > 0, 

U{XS,XX) = XU(S,X), (101) 

U{X^S^ + X^S\ A^X' + A'X') < X^U{S\X^) + X^U{S\ X^). (102) 

(a) The function A{T, X) is a convex function of X which is positive homogeneous of degree 
1, i.e., for real X, A\ A^ > 0, 

A{T,XX) = XA{T,X), (103) 

A{T, X^X^ + A^X^) < X^A{T, X^) + AM(T, X^). (104) 

Proof. This is proved in the same way as Theorem 13. 3| thus we give no details. □ 



The extremal principles imply energy dissipation properties for time-dependent states. 
Since the present kinematical setting does not have a proper dynamical framework, it is 
only possible to outline the implications without going much into details. 

11.3 Theorem. 

(i) For any time-dependent system for which S and X remain constant and which converges 
to a thermal state with positive temperature, the Hamilton energy (H) attains its global 
minimum in the limit t —>■ oo. 

(a) For any time-dependent system maintained at fixed temperature T > 0, for which X 
remains constant and which converges to a thermal state, the Helmholtz energy {H — TS) 
attains its global minimum in the limit t ^ oo. 

Proof. This follows directly from Theorem 111.11 □ 

This result is the shadow of a more general, dynamical observation (that, of course, cannot 
be proved from kinematic assumptions alone but would require a dynamical theory). Indeed, 
it is a universally valid empirical fact that in all natural time-dependent processes, energy is 
lost or dissipated, i.e., becomes macroscopically unavailable, unless compensated by energy 
provided by the environment. Details go beyond the present framework, which adopts a 
strictly kinematic setting. 
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12 The third law: Quantization 



The third law of thermodjTiamics asserts that the value of the entropy is always nonnegative. 
But it cannot be deduced from our axioms without making a further assumption, as a simple 
example demonstrates. 

12.1 Example. The algebra E = with pointwise operations is a Euclidean *-algebra 
for any integral of the form 

1 ^ 

n=l 

the axioms are trivial to verify. For this integral the state defined by 

1 ^ 

n=l 

is a state with entropy S given by 5'„ = ^logw„. The value of the entropy 

1 ™ k ^ 

n=l n=l 

is negative if we choose the Wn such that Yl^n < 1- 



Thus, we need an additional condition which guarantees the validity of the third law. Since 
the third law is also violated in classical statistical mechanics, which is a particular case 
of the present setting, we need a condition which forbids the classical interpretation of our 
axioms. 

We take our inspiration from a simple information theoretic model of states discussed in 
Appendix lAj which has this property. (Indeed, the third law is a necessary requirement 
for the interpretation of the value of the entropy as a measure of internal complexity, as 
discussed there.) There, the integral is a sum over the components, and, since functions 
were defined componentwise, 

/W) = EW«)- (105) 

We say that a quantity / is quantized iff (11051) holds with a suitable spectrum {/„ | n G 
A/"} for all functions F for which F{f) is strongly integrable; in this case, the are called 
the levels of /. For example, in the quantum setting all trace class linear operators are 
quantized quantities, since these always has a discrete spectrum. 

Quantization is the additional ingredient needed to derive the third law: 



54 



12.2 Theorem. (Third law of thermodynamics) 

If the entropy S is quantized then S* > 0. Equality holds iff the entropy has a single level 
only (\Af\ = 1). 

Proof. We have 

l = Ip = Ie-'/' = J2e-'«/' = J2p^, (106) 

where all p„ = e~'^"/^ > 0, and 

S = JSp = JSe-'/^ = J2 Sue''-/' = ^^Pn- (107) 

If A/" = {n} then ([TIM]) implies Pn = 1, Sn = 0, and ffTUTD gives S = 0. And if |7V| > 1 then 
( 11061) gives Pn < 1, hence S'„ > for all n G A^, and fll07p implies 5^ > 0. □ 



In quantum chemistry, energy H, volume V, and particle numbers Ni, . . . , Ng form a quan- 
tized family of pairwise commuting Hermitian variables. Indeed, the Hamiltonian H has 
discrete energy levels if the system is confined to a finite volume, \^ is a number, hence has 
a single level only, and Nj counts particles hence has as levels the nonnegative integers. As 
a consequence, the entropy S = T~^{H + PV — p ■ N) is quantized, too, so that the third 
law of thermodynamics is valid. The number of levels is infinite, so that the value of the 
entropy is positive. 

A zero value of the entropy (absolute zero) is therefore an idealization which cannot be 
realized in practice. But Theorem 112.21 implies in this idealized situation that entropy and 
hence the joint spectrum of {H, V, Ni, . . . , Ns) can have a single level only. 

This is the situation discussed in ordinary quantum mechanics (pure energy states at fixed 
particle numbers). It is usually associated with the limit T — > 0, though at absolute tem- 
perature T = 0, i.e., infinite coldness j3, the thermal formalism fails (but low T asymptotic 
expansions are possible). 

To see the behavior close to this limit, we consider for simplicity a canonical ensemble with 
Hamiltonian H (Example 14.41) : thus the particle number is fixed. Since S is quantized, the 
spectrum of H is discrete, so that there is a finite or infinite sequence Eq < Ei < E2 < . . . 
of distinct energy levels. Denoting by P„ the (rank dn) orthogonal projector to the dn- 
dimensional eigenspace with energy En, we have the spectral decomposition 



0(i7) = J]</.(En)Pn 



n>0 

for arbitrary functions (j) defined on the spectrum. In particular. 
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The partition function is 

Z = tre-^^ = 5^e-'3^nrP„. = ^e-'^^-rf^. 

As a consequence, 



hence values take the form 

if}^fe-^"f^f(^^r,^} (108) 

From this representation, we see that only the energy levels En with 

En<Eo + OikT) 

contribute to a canonical ensemble of temperature T. If the temperature T is small enough, 
so that kT <^ E2 — Eq, the exponentials ^-Pi^^-^o) -^vj^h n > 2 can be neglected, and we 
find 

do + e-P^^^-^^)di do do(e/5(^i-^o)rfo + di)' ^ ' 

Thus, the system behaves essentially as the two level system discussed in Examples 14.41 and 
18 .51 the spectral gap Ei — Eq takes the role of E. In particular, if already kT <^ Ei — Eq, 
we find that 

e"^/^ = do iPo + O(e-^(^i-^o)) ^ d^^Po (if kT <^ E^ - Eo) 

is essentially the projector to the subspace of minimal energy, scaled to ensure trace one. 

In the nondegenerate case, where the lowest energy eigenvalue is simple, there is a cor- 
responding normalized eigenvector ip, unique up to a phase, satisfying the Schrodinger 
equation 

Hi) = Eotfj, = 1 {Eo minimal). (110) 
In this case, the projector is Pq = ipip* and has rank do = 1. Thus 

e-^/^ = ^V'* + 0(e-^(^i-^")). 

has almost rank one, and the value takes the form 

{g) = tr 6-^/^(7 ~ titprg = (111) 

In the terminology of quantum mechanics, Eo is the ground state energy, the solution ip 
of (11 101) is called the ground state, and (g) = ip*gil) is the expectation of the observable 
g in the ground state. 
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Our derivation therefore shows that - unless the ground state is degenerate - a canonical 
ensemble at sufficiently low temperature is in an almost pure state described by the quantum 
mechanical ground state. 

Thus, the third law directly leads to the conventional form of quantum mechanics, which can 
therefore be understood as the low temperature limit of thermodynamics. It also indicates 
when a quantum mechanical description by a pure state is appropriate, namely always when 
the gap between the ground state energy and the next energy level is significantly larger 
than the temperature, measured in units of the Boltzmann constant. (This is the typical 
situation in most of quantum chemistry and justifies the use of the Born-Oppenheimer 
approximation in the absence of level crossing; cf. Smith |116] . Yarkony |133] ). Moreover, 
it gives the correct (mixed) form of the state in case of ground state degeneracy, and the 
form of the correction terms when the energy gap is not large enough for the ground state 
approximation to be valid. 

It is remarkable that thermodynamics in this way predicts the Schrodinger equation, the 
relevance of the spectrum of the Hamiltonian, and the formula for quantum expectations in 
a pure state. Indeed, after adding suitable dynamical assumptions, it is possible to interpret 
all quantum mechanics from this point of view. The resulting thermal interpretation of 
quantum mechanics will be discussed in Neumaier [95] . 



13 Local, microlocal, and quantum equilibrium 

As we have seen in Section [9], when descriptions on several levels are justified empirically, 
they differ significantly only in quantities which are negligible in the more detailed models, 
or by terms which are not observable in principle. Thus, the global equilibrium description 
is adequate at some resolution if and only if only small nonequilibrium forces are present, 
and a more detailed local equilibrium description will (apart from variations of the Gibbs 
paradox which should be cured on the more detailed level) agree with the global equilibrium 
description to the accuracy within which the differences in the corresponding approxima- 
tions to the entropy, as measured by the relative entropy (!50|) . are negligible. Of course, 
if the relative entropy of a thermal state relative to the true Gibbs state is large then the 
thermal state cannot be regarded as a faithful description of the true state of the system, 
and the thermal model is inadequate. 

In statistical mechanics proper (where the microscopic dynamics is given), the relevant 
extensive quantities are those whose values vary slowly enough to be macroscopically ob- 
servable at a given spatial or temporal resolution (cf. Balian 0). Which ones must be 
included is a difficult mathematical problem which has been solved only in simple situ- 
ations (such as monatomic gases) where a weak coupling limit applies. In more general 
situations, the selection is currently based on phenomenological consideration, without any 
formal mathematical support. 
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In equilibrium statistical mechanics, which describes time- independent, global equilibrium 
situations, the relevant extensive quantities are the additive conserved quantities of a mi- 
croscopic system and additional parameters describing order parameters that emerge from 
broken symmetries or various defects not present in the ideal model. Phase equilibrium 
needs, in addition, copies of the extensive variables (e.g., partial volumes) for each phase, 
since the phases are spatially distributed, while the intensive variables are shared by all 
phases. Chemical equilibrium also accounts for exchange of atoms through a fixed list 
of permitted chemical reactions whose length is again determined by the desired resolution. 

In states not corresponding to global equilibrium - usually called non-equilibrium states, 
a thermal description is still possible assuming so-called local equilibrium. There, the 
natural extensive quantities are those whose values are locally additive and slowly varying 
in space and time and hence, reliably observable at the scales of interest. In the statistical 
mechanics of local equilibrium, the thermal variables therefore become space- and time- 
dependent fields (Robertson |llUj ). On even shorter time scales, phase space behavior 
becomes relevant, and the appropriate description is in terms of microlocal equilibrium 
and position- and momentum-dependent phase space densities. Finally, on the microscopic 
level, a linear operator description in terms of quantum equilibrium is needed. 

The present formalism is still applicable to local, microlocal, and quantum equilibrium 
(though most products now become inner products in suitable function spaces), but the 
relevant quantities are now time-dependent and additional dynamical issues (relating states 
at different times) arise which are outside the scope of the present paper. 

In local equilibrium, one needs a hydrodynamic description by Navier-Stokes equations and 
their generalizations; see, e.g., Beris & Eswards [16j, Oettinger jS^, Edwards et al. 
|38j . In the local view, one gets the interpretation of extensive variables as locally conserved 
(or at least slowly varying) quantities (whence additivity) and of intensive variables as 
parameter fields, which cause non-equilibrium currents when they are not constant, driving 
the system towards global equilibrium. In microlocal equilibrium, one needs a kinetic 
description by the Boltzmann equation and its generalizations; see, e.g., Bornath et al. 
[21], Calzetta & Hu [27], Muller & Ruggeri [90]. 

Quantum equilibrium. Full microscopic dynamics must be based on quantum mechanics. 
In quantum equilibrium, the dynamics is given by quantum dynamical semigroups. We 
outline the ideas involved, in order to emphasize some issues which are usually swept under 
the carpet. 

Even when described at the microscopic level, thermal systems of sizes handled in a lab- 
oratory are in contact with their environment, via containing walls, emitted or absorbed 
radiation, etc.. We therefore embed the system of interest into a bigger, completely isolated 
system and assume that the quantum state of the big system is described at a fixed time by 
a normalized wave function in some Hilbert space H. (Assuming instead a mixed state 
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given by a density operator would not alter the picture significantly.) The value of a linear 
operator g in the big system is 

(g) = rgt. (112) 

cf. fillip . The small system is defined by a Euclidean *-algebra E of linear operators densely 
defined on H, composed of all meaningful expressions in field operators at arguments in the 
region of interest, with integral given by the trace in the big system. Since (11121) . restricted 
to (7 G E, satisfies the rules (R1)-(R4) for a state, the big system induces on the system 
of interest a state. By standard theorems (see, e.g., Thirring [122]), there is a unique 
density operator p G E such that (g) = Jpg for all G E with finite value. Moreover, p 
is Hermitian and positive semidefinite. If is not an eigenvalue of p (which will usually be 
the case), then (•) is a Gibbs state with entropy S = — ^logp. To put quantum equilibrium 
into the thermal setting, we need to choose as extensive variables a family spanning the 
algebra E; then each such S can be written in the form (l5ll) . 

Of course, ip and hence the state (■) depend on time. If the reduced system were goverened 
by a Schrodinger equation then p would evolve by means of a unitary evolution; in particular, 
S = {S) = —Tctiplogp would be time-independent. However, the system of interest does 
not inherit a Schrodinger dynamics from the isolated big system; rather, the dynamics of 
p is given by an integro-differential equation with a complicated memory term, defined by 
the projector operator formalism described in detail in Grabert [50]; for summaries, see 
Rau & MuLLER |105] and Balian In particular, one can say nothing specific about 
the dynamics of S. 

In typical treatments of such reduced descriptions, one assumes that the memory decays 
sufficiently fast; this so-called Markov assumption can be justified in a weak coupling 
limit (Davies [3Z], Spohn |117j ). which corresponds to a system of interest nearly inde- 
pendent of the environment. But a typical thermal system, such as a glass of water on 
a desk is held in place by the container. Considered as a nearly independent system, the 
water would behave very differently, probably diffusing into space. Thus, it is questionable 
whether the Markov assumption is satisfied; a detailed investigation of the situation would 
be highly desirable. I only know of few discussions of the problem how containers modify 
the dynamics of a large quantum system; see, e.g., Lebowitz & Frisch [79], Blatt [18] 
and RiDDERBOS |108j . One should expect a decoherence effect (Brune et al. [21]) of the 
environment on the system which, for large quantum systems, is extremely strong (ZUREK 
|134] ). A fundamental derivation should be based on quantum field theory; the so-called 
exact renormalization group equations (see, e.g., PoLONYl & Sailer |1U2] . Berges [S]) 
have a thermal fiavor and might be a suitable starting point. 

However, simply assuming the Markov assumption as the condition for regarding the system 
of interest to be effectively isolated allows one to deduce for the resulting Markov ap- 
proximation a deterministic differential equation for the density operator. The dynamics 
then describes a linear quantum dynamical semigroup. All known linear quantum dynam- 
ical semigroup semigroups (cf. Davies |37j) on a Hilbert space correspond to a dynamics 
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in the form of a Lindblad equation 

p='-{pH-H*p)+P*p (113) 

(Lindblad [ST], Gorini et al. p9j), where the effective Hamiltonian is a not neces- 
sarily Hermitian operator and P* is the dual of a completely positive map of the form 

PU)=Q*JU)Q for all /gE, 

with some linear operator Q from E to a second *-algebra E' and some *-algebra homomor- 
phism J from E to E'. (Stinespring [119!|, Davies [37i Theorem 2.1]). Their dynamics 
is inherently dissipative; for time t — > cxd, P*p tends to zero, which usually implies that, 
apart from a constant velocity, the limiting state is a global equilibrium state. 

Thus, the irreversibility of the time evolution is apparent already at the quantum level, 
being caused by the fact that all our observations are done in a limited region of space. 
The prevalence here on earth of matter in approximate equilibrium could therefore possibly 
be explained by the fact that the earth and with it most of its materials are extremely old. 

For a system reasonably isolated (in the thermodynamical sense) from its environment, one 
would expect H to contain a confining effective potential well and P to be small. It would 
be interesting to understand the conditions (if there are any) under which the dissipation 
due to P can be neglected. 

We now consider relations within the hierarchy of the four levels. The quantum equilibrium 
entropy S'qu, the microlocal equilibrium entropy S^\, the local equilibrium entropy S'lc, the 
global equilibrium entropy Sgi denote the values of the entropy in a thermal description of 
the corresponding equilibrium levels. The four levels have a more and more restricted set 
of extensive quantities, and the relative entropy argument of Theorem 15.31 can be applied 
at each level. Therefore 

Squ < S^, < Sle < Sgl. (114) 

In general the four entropies might have completely different values. There are four essen- 
tially different possibilities, 

(i) S'qu ~ S'ml ~ S'lc ~ S'gi, 

(ii) Squ ~ Sml ~ Sic <^ Sgl, 

(iii) Squ ~ Sml ^ Sic < Sgl, 

(iv) Squ ^ Sml < Sic < Sgl, 

with different physical interpretations. As we have seen in Section [9l a thermal description 
is valid only if the entropy in this description approximates the true entropy sufficiently well. 
All other entropies, when significantly different, do not correspond to a correct description 
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of the system; their disagreement simply means failure of the coarser description to match 
reality. (Again, we disregard variations of the Gibbs paradox which should be cured on the 
fundamental level.) Thus which of the cases (i)-(iv) occurs decides upon which descriptions 
are valid, (i) says that the state is in global equilibrium, and all four descriptions are valid, 
(ii) that the state is in local, but not in global equilibrium, and only the three remaining 
descriptions are valid, (iii) says that the state is in microlocal, but not in local equilibrium, 
and in particular not in global equilibrium. Only the basic and the microlocal descriptions 
are valid. Finally, (iv) says that the state is not even in microlocal equilibrium, and only 
the quantum description is valid. 

Thus (assuming that the fundamental limitations in obserability are correctly treated on the 
quantum level), the entropy is an objective quantity, independent of the level of accuracy 
with which we are able to describe the system, although the precise value it gets in a model 
of course depends on the accuracy of the model. The observation (by Grad [51J, Balian 
[Sj, and others) that entropy may depend significantly on the description level is explained 
by two facts which hold for models of any kind, not just for entropy, namely: 

(i) that if two models disagree in their predictions, at most one one of them can be correct, 
and 

(ii) that if two models agree in their predictions, the more detailed model has unobservable 
details. 

Since unobservable details cannot be put to an experimental test, the more detailed model in 
case (ii) is questionable unless dictated by fundamental considerations of formal simplicity. 



A Appendix: Entropy and unobservable complexity 

The concept of entropy is usually introduced either historically by the Carnot cycle and an 
assumed informal form of the second law of thermodynamics, or, following a more recent 
(1957) subjectivistic approach (Jaynes [62| [63]). by the information available to an observer 
and the second law in form of the principle of maximum entropy. 

In our treatment, we avoided both, rejecting them as being principles that require explana- 
tion themselves. The Carnot cycle is clearly inappropriate in a foundational setting, and a 
subjectivist approach is inappropriate for the description of a world that existed long before 
there were human observers. 

However, to connect the traditional approach with the present setting, we present in this 
section an informal example of a simple stochastic model in which the entropy indeed has 
an information theoretical interpretation and then discuss what this can teach us about a 
probability-free macroscopic view of the situation. 

Suppose that we have a simple stationary device which, in regular intervals, delivers a 
reading n from a countable set M of possible readings. For example, the device might count 
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the number of events of a certain kind in fixed periods of time; then M = {0, 1, 2, . . .}. 

We suppose that, by observing the device in action for some time, we are led to some 
conjecture about the (expected) relative frequencies p„ of readings n G A/"; since the device 
is stationary, these relative frequencies are independent of time. (If M is finite and not 
too large, we might take averages and wait until these stabilize to a satisfactory degree; if 
M is large or infinite, most n & M will not have been observed, and our conjecture must 
depend on educated guesses. This introduces some subjectiveness and is the reason why 
the following material is deferred to an appendix.) 

Clearly, in order to have a consistent interpretation of the p„ as relative frequencies, we 
need to assume that each reading is possible: 

p„ > for all n e A/", (115) 

and some reading occurs with certainty: 

Y,Pn = l. (116) 
neA/" 

(For reasons of economy, we shall not allow j9„ = in (11 150 . which would correspond to 
readings that are either impossible, or occur too rarely to have a scientific meaning. Clearly, 
this is no loss of generality.) Knowing relative frequencies only means that (when M > 1) 
we only have incomplete information about future readings of the device. We want to 
calculate the information deficit by counting the expected number of questions needed to 
identify a particular reading unknown to us, but known to someone else who may answer 
our questions with yes or no. 

Consider an arbitrary strategy for asking questions, and denote by s„ the number of ques- 
tions needed to determine the reading n. With q questions we can distinguish up to 2'^ 
different cases; but since reading n is already determined after s„ questions, reading n is 
obtained in 2"^"*" of the 2"^ cases (when Sn ^ q)- Thus 

Sn<q 

If we divide by 2'^ and then make q arbitrarily large we find that 

^2'""<1. (117) 

It is not difficult to construct strategies realizing the Sn whenever flll7p holds. 

Since we do not know in advance the reading, we cannot determine the precise number 
of questions needed in a particular unknown case. However, knowledge of the relative 
frequencies allows us to compute the average number of questions needed, namely 

S = PnSn- (118) 
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To simplify notation, we introduce the abbreviation 

for every quantity / indexed by the elements from A/", and we use the convention that 
inequalities, operations and functions of such quantities are understood componentwise. 
Then we can rewrite f lll5l) - flll8p as 

p > 0, /p = 1 , (120) 
s = !ps, /2-^<1, (121) 

and 

7={f)--=Ipf- (122) 
is the average of an arbitrary quantity / indexed by J\f. 



We now ask for a strategy which makes the number s as small as possible. However, we 
idealize the situation a little by allowing the s„ to be arbitrary nonnegative real numbers 
instead of integers only. This is justified when the size of A/" is large or infinite since then 
most Sn will be large numbers which can be approximated by integers with a tiny relative 
error. 

A.l Theorem. The entropy S, deGned by 

S := —klogp, where k = , (123) 

log 2 

satisGes S <s, with equahty if and only if s = S. 

Proof. (11231) implies logp = —5 log 2, hence p = 2^^. Therefore 

2-' = p2^-' = pe^°s2{s-s) y ^ log 2(5 - s)), 

with equality iff S* = s. Thus 

p{S-s)<^{2-^~p) = H2-'-p) 

and _ 

S-s = jpiS - s) < Jk{2-' - p) 

= k J2-' - k jp<k-k = o. 

Hence s > S, and equality holds iff s = S*. □ 
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Since f ll23p implies the relation p = e we have (/) = Jpf = Je-^/^f. Thus, the 
expectation mapping is a Gibbs state with entropy S, explaining the name. Note that 
s = S defines an admissible strategy since 

neA/" 

hence 2~^" < 1, Sn > for all n E J\f. Thus, the entropy 5* is the unique optimal 
decision strategy. The expected entropy, i.e., the mean number 

S = {S) = JpS = Jplogp (124) 

of questions needed in an optimal decision strategy, is nonnegative, 

;S>0. (125) 

It measures the information deficit of the device with respect to our conjecture about 
relative frequencies. (Traditionally, this is simply called the entropy, while we reserve this 
word for the random variable (I123p . Also commonly used is the name information for 
S, which invites linguistic paradoxes since ordinary language associates with information a 
connotation of relevance or quality which is absent here. An important book on information 
by Brillouin [23] emphasizes this very carefully, by distinguishing absolute information 
from its human value or meaning. Katz [70] uses the phrase 'missing information'.) 

The information deficit says nothing at all about the quality of the information contained 
in the summary p of our past observations. An inappropriate p can have arbitrarily small 
information deficit and still give a false account of reality. E.g., if for some small e > 0, 

p„ = e"-i(l-e) forn = l,2,..., (126) 

expressing that the reading is expected to be nearly always 1 {pi = 1 — e) and hardly ever 
large, then 

S = k(^log{l-e) + ^^logej ^ as £ 0. 

Thus the information deficit can be made very small by the choice (11261) with small e, 
independent of whether this choice corresponds to the known facts. The real information 
value of p depends instead on the care with which the past observations were interpreted, 
which is a matter of data analysis and not of our model of the device. If this is done badly, 
our expectations will simply not be matched by reality. This shows that the entropy has 
nothing to do with "our knowledge of the system" - a subjective, ill-defined notion - but 
refiects objective properties of the stochastic process. 

Relations to thermodynamics. Now suppose that the above setting happens at a very 
fast, unobservable time scale, so that we can actually observe only short time averages 
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( 11221) of quantities of interest. Then / = (/) simply has the interpretation of the time- 
independent observed value of the quantity /. The information deficit simply becomes 
the observed value of the entropy S. Since the information deficit counts the number 
of optimal decisions needed to completely specify a (microscopic) situation of which we 
know only (macroscopic) observed values, the observed value of the entropy quantifies the 
intrinsic (microscopic) complexity present in the system. 

However, the unobservable high frequency fluctuations of the device do not completely dis- 
appear from the picture. They show in the fact that generally g"^ y^^'^, leading to a nonzero 
limit resolution fl53p of Hermitian quantities. This is precisely the situation characteristic 
of the traditional treatment of thermodynamics within classical equilibrium statistical me- 
chanics, if we assume ergodicity, i.e., that ensemble averages equal time averages. (This is 
a problematic assumption; see, e.g., the discussion in Sklar |115j .) There, all observed 
values are time-independent, described by equilibrium thermal variables. But the underly- 
ing high-frequency motions of the atoms making up a macroscopic substance are revealed 
by nonzero limit resolutions. 

Note that even a deterministic but chaotic high frequency dynamics, viewed at longer time 
scales, looks stochastic, and exactly the same remarks about the unobservable complexity 
and the observable consequences of fluctuations apply. Even if fluctuations are observ- 
able directly, these observations are intrinsically limited by the necessary crudity of any 
actual measurement protocol. For the best possible measurements (and only for these), the 
resolution of / in the experiment is given by the limit resolution Res(/), the size of the 
unavoidable fluctuations. 

Due to the quantum structure of high frequency phenomena (on an atomic or subatomic 
scale) , it may, however, seem problematic to interpret the thermodynamic limit resolutions 
in terms of a simple short time average of some underlying microscopic reality. Fortunately, 
as we have seen, such an interpretation is not necessary. 



B Appendix: The maximum entropy principle 

Motivated by a subjective, information theoretic approach to probability, Jaynes ([62] 
for the classical case and [63] for the quantum case), used the maximum entropy princi- 
ple to derive the thermal formalism. This approach has gained considerable acceptance 
in the physics community. But why should nature be concerned about the the amount of 
information an observer has? According to whose knowledge should it behave? Since ther- 
modynamics is completely observer independent, its foundations should have this feature, 
too. 

The present approach avoids this subjective touch and shows that a fully objective foun- 
dation is possible. The maximum entropy principle becomes a theorem, valid (only) under 
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precisely specified conditions. To denote the extensive variables, we use the barless notation 
of Section [2J 



B.l Theorem. ("Entropy form of the second lawj 

In an arbitrary state of a standard thermal system 

S < S{H, V, N) := min {T-\H + PV - fi ■ N) \ A(T, P, jj,) = 0}, 

with equahty iff the state is an equihbrium state. The remaining thermal variables are then 
given by 

f)Q f)Q f)Q 

T-' = —{H,V,N), T-'P=g^{H,V,N), T-'f, = - — {H,V,N), (127) 
U = H = TS{T, V,N) ~ PV + n- N. (128) 

Proof. This is proved in the same way as Theorem I3.2[ thus we give no details. □ 



The only situation in which the value of the entropy must increase to reach equilibrium 
is when H, V and are kept constant. Under different constraints, the entropy is no 
longer maximal. For example, if one pours milk into one's coffee, stirring mixes coffee 
and milk, thus increasing complexity. Macroscopic order is restored after some time when 
this increased complexity has become macroscopically inaccessible - since T, P and are 
constant in a state of minimal Gibbs energy, and not in a state of maximal entropy! More 
formally, the first law shows that, for standard systems at fixed value of the particle number, 
the value of the entropy decreases when H oi V (or both) decrease reversibly; this shows 
that the value of the entropy may well decrease if accompanied by a corresponding decrease 
of H or V . The same holds out of equilibrium (though our argument no longer applies); 
for example, the reaction 2 H2 + O2 — * 2 H2O (if catalyzed) may happen spontaneously at 
constant T = 25 °C and P = 1 atm, though it decreases the entropy. 

The conditions of constant if, V and A^, needed to argue that entropy must increase, are not 
easily realized in nature. From a fundamental point of view, they are even always violated: 
The only truly isolated system is the universe as a whole; but the universe expands, i.e., 
V changes. Thus, the assumption of constant if, V and is unrealistic, and while the 
second law in the form of a maximum entropy principle may be of theoretical and historical 
importance, it is not the extremal principle ruling nature. 

The irreversible nature of physical processes is instead manifest, irrespective of the entropy 
balance, as energy dissipation which, in a microscopic interpretation, indicates the loss 
of energy to the unmodelled microscopic degrees of freedom. Macroscopically, the global 
equilibrium states are therefore states of least free energy (the correct choice of which 
depends on the boundary condition), with the least possible freedom for change. (This 
macroscopic immutability is another intuitive explanation for the maximal macroscopic 
order in global equilibrium states.) 
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The maximum entropy principle is also heralded as an rational, unprejudiced way of ac- 
counting for available information in incompletely known statistical models. However, it ac- 
counts only for information about exact expectation values, and the model produced by the 
maximum entropy principle describes the true situation correctly only if the expectations 
of the sufficient statistics of the true model are available exactly; see, e.g., Barndorff- 
NlELSEN pT]. Which statistics can be considered sufficient depends on the true situation 
and is difficult to assess in advance. The maximum entropy principle amounts in this sit- 
uation simply to the (often unfounded) assumption that the sufficient statistics are among 
the quantities of which one happens to know the expectation values. Moreover, questions 
about the uninformative prior which must be assumed to describe the state of complete 
ignorance affect the results of the maximum entropy principle, making the application of 
the principle ambiguous. 

B.2 Examples, (i) If we have information in the form of a large but finite sample of 
realizations g{flk) (fc = 1, . . . , n) of a random variable g, we can obtain from it approximate 
information about all moments {g'") ~ ^g{ytk)'^/N {n = 0,1,2,...). The maximum en- 
tropy principle would infer that the distribution of g is discrete, namely that of the sample. 

(ii) If we take as uninformative prior for a real-valued random variable g the Lebesgue 
measure and only know that the mean of is 1, the maximum entropy principle does not 
produce a sensible probability distribution. If we add the knowledge of the second moment 
{g'^) = 2, we get a Gaussian distribution with mean 1 and standard deviation 1. If we then 
get to know that the random variable is in fact nonnegative and integer- valued, this cannot 
be accounted for by the principle, and the probability of obtaining a negative value remains 
large. 

(iii) But if we take as prior the discrete measure on nonnegative integers, the 'noninforma- 
tive' prior has become much more informative, the knowledge of the mean produces the 
Poisson distribution, and the knowledge of the second moment may modify this further. 

(iv) If we know that a random variable g is nonnegative and has {g"^) = 1; the Lebesgue 
measure on M+ as noninformative prior gives for g a distribution with density ^Jljiie^^ 
But we can consider instead our knowledge about h = g'^, which is nonnegative and has 
{h) = 1; the same noninformative prior now gives for h a distribution with density e~^. The 
distribution of g = ^/h resulting from this has density 2ge~^ Thus the result depends 
on whether we regard g or h as the relevant variable. 

These examples clearly demonstrate that the maximum entropy principle is an unreliable 
tool. The prior, far from being uninformative, reflects the prejudice assumed without any 
knowledge, and the choice of expectations to use reflects prior experience about which ex- 
pectations are likely to be relevant. Thus the application of the maximum entropy principle 
becomes reliable only if one knows the desired for mof the result beforehand (which was 
indeed the case with Jaynes' arguments, many years after Gibbs). Thus, the principle is 
not a suitable basis for the foundations of thermodynamics. 
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C Appendix: Some mathematical lemmas 



In this section we prove some mathematical results needed in the main text. The proofs 
are rigorous for the case E C only; this covers the A^-level quantum system, but also 
quantum field theory in the finite lattice approximation, and implies the results whenever 
E is finite-dimensional. Similar arguments work in more general situations if we use spec- 
tral resolutions; cf., e.g., Thirring |122j (who works in C*-algebras and von Neumann 
algebras). I'd appreciate to be informed about possible proofs in general that only use the 
properties of Euclidean *-algebras (and perhaps further, elementary assumptions). 

C.l Proposition. For arbitrary quantities f, g, 

e^g = ge^ if f and g commute, 
r = f loge^ = /, 

/>0 ^ /7>0, {^f = i\ 
For any quantity f = f{s) depending continuously on s E [a, b], 

fdsjf{s) = j( fdsfis) 

J a J a 

and for any quantity f = /(A) depending continuously differentiably on a parameter vector 
A, 

^ff = Jdf/dX. 

Proof. In finite dimensions, the first four assertions are standard matrix calculus, and 
the remaining two statements hold since / / must be a finite linear combination of the 
components of /. □ 

C.2 Proposition. Let f,g be quantities depending continuously differentiably on a pa- 
rameter vector X, and suppose that 

[fiX),giX)] = OforallX. 

Thus, for any continuously differentiable function F of two variables, 

^fF{f,g) = fd^F{f,g)^ + fd,F{f,g)^ . (129) 
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Proof. We prove the special case F{x,y) = x'^y"', where (11291) reduces to 

The general case then follows for polynomials F{x, y) by taking suitable linear combinations, 
and for arbitrary F by a limiting procedure. To prove fll30p . we note that, more generally, 

m+n 
m+n 



m+n 



— ^2 //j+i ■ ■ ■ fm+nfi ■ ■ ■ ' 

using the cyclic commutativity (EA2) of the integral. If we specialize to fj = / if j < m, 
fj = g if j > m, and note that / and g commute, we arrive at (11301) . □ 



Of course, the proposition generalizes to families of more than two commuting quantities; 
but more important is the special case g = f'- 

C.3 Corollary. For any quantity f depending continuously differentiably on a parameter 
vector X, and any continuously differentiable function F of a single variable, 

^/w) = /n/)|. (131) 

A real- valued function is convex in a convex set X C M" if is defined on X and, for all 
x,y e X, 

(f){tx + (1 - t)y) < t(j){x) + (1 - t)(f){y) for < t < 1. 
Clearly, (p is convex iff for all x,y & X , the function /i : [0, 1] — M defined by 

/i(t) := (f){x + t{y - x)) 

is convex. It is well-known that, for twice continuously differentiable (j), this is the case 
iff the second derivative /i"(t) is nonnegative for < t < 1. Note that by a theorem 
of Aleksandrov (see Aleksandrov ^, Franklin [16], Rockafellar [Il2j ). convex 
functions are almost everywhere twice continuously differentiable, in the sense that, for 
every x G X, there exist a gradient vector ^0(x) G M" and a symmetric, positive definite 
Hessian matrix ^0(a;) G R"^" such that, for arbitrary h G M", 

d 1 d"^ 

(j)(x + h) = (l){x) + h^—(t){x) + -h^—(l){x)h + o{\\hf). 

ox 2 ax^ 
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A function (j) is concave iff —0 is convex. Thus, for twice continuously differentiable (f), (f) is 
concave iff n"{t) < for < i < 1. 

C.4 Proposition. If is convex then the function defined by 

ip{s, x) := s(j){x/s) 
is convex for s > and concave for s < 0. 

Proof. It suffices to show that /j{t) := i]j{s + tk, x + th) is convex (concave) for all s, x, h, k 
such that s + tk > (resp. < 0). Let z{t) := {x + th)/{s + tk) and c :— sh — kx. Then 

^'w = (s+tkr ^^^^ ^^'^ tk)(i>i4t)), 

hence 

^i'(t)^k<l>{z(t)) + <l^\z(t)) ' 



s + tk' 



which has the required sign. □ 
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